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1  Preface 


Swirling  and  Swimming  in  Turbulence  was  the  theme  at  the  2010  GFD  Program.  Professors 
Glenn  Flierl  (M.I.T.),  Antonello  Provenzale  (ISAC-CNR,  Turin)  and  Jean-Luc  Thiffeault 
(University  of  Wisconsin)  were  the  principal  lecturers.  Together  they  navigated  an  elegant 
path  through  topics  ranging  from  mixing  protocols  and  efficiencies  to  ecological  strategies, 
schooling  and  genetic  development.  The  first  ten  chapters  of  this  volume  document  these 
lectures,  each  prepared  by  pairs  of  this  summer’s  GFD  fellows.  Following  on  are  the  written 
reports  of  the  fellows’  own  research  projects. 

This  summer’s  fellows  were: 

•  Michael  Allshouse,  MIT 

•  Emma  Boland-Thompson,  University  of  Cambridge 

•  Renske  Gelderloos,  KNMI,  Utrecht 

•  David  Goluskin,  Columbia  University 

•  Georgy  Manucharyan,  Yale  University 

•  Woosok  Moon,  Yale  University 

•  Kiori  Obuse,  Kyoto  University 

•  Amanda  O’Rourke,  Princeton  University 

•  Sam  Pegler,  University  of  Cambridge 

•  Anubhab  Roy,  Jawaharlal  Nehru  Centre  for  Advanced  Scientific  Research 

In  2010  the  Sears  Public  Lecture  was  delivered  by  Professor  Jim  Murray,  of  Oxford 
University  and  the  University  of  Washington.  Jim  challenged  us  to  use  mathematics  in  real 
world  problems,  citing  examples  from  biology  and  sociology.  Specifically,  Jim  argued  how 
one  could  exploit  mathematical  modelling  to  assist  in  the  prediction  of  tumour  growth,  and 
to  gauge  the  successfulness  of  marriage  partnership.  Over  a  hundred  listeners  filed  into 
Redfield  for  the  occasion,  and  we  enjoyed  refreshments  in  the  evening  air  afterwards  at  the 
back  of  Redfield. 

Neil  Balmforth  and  Jean-Luc  Thiffeault  acted  as  the  co-directors  for  the  summer  (once 
Jean-Luc’s  Lectures  were  over).  New  GFD  Faculty  member  Colm  Caulfield  (University  of 
Cambridge)  nimbly  acted  as  assistant,  as  well  as  putting  in  a  stout  performance  on  the  Staff 
by  supervising  a  good  fraction  of  the  fellows.  Anders  Jensen  worked  his  usual  magic  in  the 
Lab,  and  Janet  Fields,  Jeanne  Fleming  and  Kathy  Ponti  smoothly  ran  the  program  behind 
the  scenes. 
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Lecture  1:  Stirring  and  mixing 

Jean-Luc  Thiffeault 

Notes  by  Sam  Pegler  and  Amanda  O’Rourke 
21  June  2010 


1  Introduction 

Consider  a  patch  of  dye  immersed  in  a  flowing  fluid,  confined  within  a  spatial  domain  0. 
The  concentration  of  the  dye  9(x,t )  can  be  modelled  by  the  advection-diffusion  equation 

c)Q 

—  +u-V9  =  kV29,  (1) 

where  n  is  a  coefficient  of  diffusivity  and  u(x,t )  is  a  prescribed  velocity  field.  By  using 
equation  (1),  it  is  assumed  that  the  concentration  is  advected  precisely  with  the  flow,  and 
is  also  subject  to  molecular  diffusion  by  which  the  concentration  is  allowed  to  gradually 
spread  to  adjacent  fluid  elements.  The  velocity  field  in  equation  (1)  is  assumed  to  have 
been  determined  a  priori  and  satisfies  the  incompressibility  condition 


V-w  =  0.  (2) 

We  also  assume  that  the  boundary  conditions 

n  ■  u  =  0,  (3) 

n  ■  V0  =  0,  (4) 

apply  on  the  boundary  of  the  domain  <9R,  where  n  is  the  unit  outward  normal  to  the 
boundary.  These  conditions  ensure  that  no  fluid  penetrates  the  boundary  of  the  domain 
and  that  no  concentration  is  allowed  to  diffuse  into  or  out  of  the  domain,  respectively.  (We 
can  also  use  spatially-periodic  boundary  conditions,  which  ensure  that  there  is  no  net  flux 
in  or  out  of  the  domain.) 

We  proceed  to  demonstrate  some  general  properties  of  the  system  defined  by  equations 
(1)— (4)  by  deriving  evolution  equations  for  the  spatial  mean  and  variance  of  9.  To  do  this 
we  begin  by  multiplying  equation  (1)  by  m9m~ 1  and  reordering  the  derivatives  to  give 

^ :(9m )  +  V  •  ( 9mu )  =  urn  [V  •  (9m~LV9)  -  (m  -  l)9m~2\V  9\2]  .  (5) 

Now  we  take  the  volume  integral  over  the  domain,  defined  by 

(4)  =  [  4>  dV,  (6) 


1 


(7) 


where  cf>(x,t)  is  any  function,  which  gives 

Jt{ 9m )  =  —m(m  -  l)K(9m~2\V9\2), 

on  applying  the  divergence  theorem  to  the  second  term  on  the  left-hand  side  of  equation 
(5)  and  the  first  term  on  the  right-hand  side,  and  applying  boundary  conditions  (3)-(4)  to 
make  the  resulting  boundary  terms  vanish.  Setting  m  =  1,2  gives  the  two  equations 

>  =  <8> 

|(02)  =  -2K(|V0|2),  (9) 

respectively.  The  first  equation  above  represents  conservation  of  total  concentration,  which 
is  expected  given  that  no  concentration  enters  the  domain  as  ensured  by  boundary  conditions 
(3)-(4).  The  second  equation  is  an  evolution  equation  for  the  second  moment  of  9  in  terms 
of  its  gradient.  This  equation  is  not  closed,  i.e.  it  does  not  independently  determine  the  time 
evolution  of  (92),  because  its  right-hand  side  depends  on  |V0|,  whose  time  evolution  must 
be  determined  from  a  solution  of  the  advection-diffusion  equation  (1).  However,  noting  that 
the  right-hand  side  of  equation  (9)  is  negative-definite  if  9  is  not  constant  in  space,  we  can 
deduce  that  the  second  moment  of  the  concentration  must  decrease  in  time.  Thus,  at  large 
times  the  second  moment  converges  toward  a  constant  value  in  which  the  right-hand  side  of 
(9)  vanishes.  This  occurs  if  and  only  if  the  gradient  of  the  concentration  is  zero  everywhere. 
The  system  is  driven  toward  a  fully  homogenized  state  in  which  the  second  moment  takes 
the  value 

(92)  =  (10) 

Note  that  although  (92)  never  quite  reaches  this  value,  we  say  the  system  is  mixed  once  it 
has  fallen  below  a  certain  prescribed  threshold. 

As  mentioned  previously,  equation  (9)  does  not  independently  determine  the  rate  at 
which  (92)  decreases  in  time.  In  fact,  the  rate  at  which  (92)  decreases  usually  depends 
strongly  on  the  prescribed  velocity  field,  which  does  not  appear  in  equation  (9).  Importantly, 
the  introduction  of  a  velocity  field,  which  we  refer  to  as  stirring ,  usually  has  the  effect  of 
increasing  gradients  in  the  concentration  field  |V#|  through  advection  and  thus  has  the 
potential  to  significantly  accelerate  the  onset  of  mixing. 


filaments 


Figure  1:  Typical  mixing  scenario  in  which  a  patch  of  dye  is  advected  into  filaments. 
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Figure  2:  Decay  of  the  second  moment  of  the  concentration  field  under  the  action  of  stirring. 

A  typical  stirring  scenario  is  illustrated  in  figure  1,  in  which  an  initially  isolated  patch  of 
dye  is  advected  into  thin  striations  that  exhibit  a  much  greater  concentration  gradient  than 
the  initial  patch.  The  evolution  of  the  second  moment,  corresponding  to  this  typical  mixing 
scenario,  is  shown  in  figure  2.  During  the  initial  stirring  phase,  before  thin  filaments  are 
formed,  the  concentration  varies  over  relatively  large  length  scales  and  the  rate  of  mixing 
will  be  slow.  After  a  short  transient,  by  which  point  the  filaments  have  become  sufficiently 
thin,  the  rate  of  diffusion  is  large  enough  to  balance  the  rate  of  thinning  of  the  filaments 
and  the  rate  of  mixing  d(82)/dt  is  no  longer  characterized  by  n,  but  is  instead  characterized 
by  the  rate  at  which  the  velocity  field  is  straining  the  fluid  into  filaments.  Once  this  occurs 
the  characteristic  width  of  the  filaments  remains  constant,  and  the  rate  of  mixing  reaches 
a  maximum  which  persists  for  all  time.  The  independence  of  the  rate  of  diffusion  on  the 
parameter  n  during  the  mixing  phase  implies  the  scaling 

|V0|  ~  ft- 1/2 ,  (11) 

which  can  be  used  as  an  indication  of  good  mixing  in  practical  applications. 

The  coefficient  of  diffusion  k  is  small  in  most  applications.  Hence,  the  characteristic 
timescale  over  which  diffusion  takes  place  in  the  absence  of  an  advective  flow  (u  =  0) 
is  typically  very  long.  For  example,  in  the  context  of  heat  diffusing  through  air,  k  ~ 
2.2  x  10~5m2/s,  so  in  a  room  with  sides  of  length  L  =  10m,  a  characteristic  diffusion 
timescale  is  L2 / k  ~  53  days.  Thus,  if  one  is  aiming  to  mix  the  concentration  field,  relying 
on  diffusion  in  the  absence  of  advection  is  typically  ineffective.  In  the  following  section  we 
present  an  example  that  can  be  solved  analytically  to  demonstrate  the  development  of  the 
mixing  phase  discussed  above. 
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The  filament  solution 


Consider  the  prescribed  velocity  field 

u(x)  =  (Ax, -A  y), 


(12) 


where  A  is  a  constant  parameter,  which  drives  fluid  towards  the  x-axis  in  a  straining  motion 
with  a  hyperbolic  point  at  the  origin.  In  this  case  the  advection-diffusion  equation  takes 
the  form 

80  .89  80  2 

at  +  Xxai  ~  x%  =  kV  (13) 

This  equation  can  be  solved  exactly,  but  for  simplicity  here  we  consider  only  x-independent 
solutions  of  the  form 

0(y,  t)  =  e~xtf(y).  (14) 

Substituting  this  into  equation  (13)  gives  the  ordinary  differential  equation 

-A/  -  A yf  =  nf" .  (15) 


If  we  further  assume  that  the  concentration  vanishes  in  the  far  field,  /  — >  0  as  y  — >  ±oo,  we 
can  solve  equation  (15)  and  derive  the  solution 


e  =  e-xte-y2/212, 


(16) 


where 

i  =  (17) 

is  called  the  Batchelor  length. 

The  solution  (16)  represents  a  filament  that  is  an  attractor  for  typical  initial  conditions 
of  the  system.  For  any  reasonable  initial  concentration  field,  the  compression  due  to  the 
straining  velocity  will  attempt  to  confine  the  concentration  into  an  ever  thinner  region  about 
the  x-axis  until  its  gradient  becomes  sufficiently  large  for  diffusion  to  balance  the  rate  of 
compression.  The  width  of  the  resulting  filament  is  characterized  by  the  Batchelor  length 
(17),  which  contains  the  necessary  scaling  for  (11)  to  apply.  Furthermore,  we  see  from  the 
solution  (16)  that  the  rate  of  mixing  A  during  the  mixing  phase  is  indeed  independent  of  k. 
Note  that  equation  (8)  must  be  taken  with  a  grain  of  salt,  because  the  concentration  does 
not  vanish  as  |x|  — >  oo.  This  is  a  consequence  of  the  neglect  of  the  x  dependence. 


2  Effective  Diffusivity 

Recall  that  filaments  undergoing  advection  are  strained  along  one  direction  at  a  rate  A  and 
are  compressed  along  the  other  to  the  Batchelor  length  l  =  \J k/ A,  where  n  is  the  molecular 
diffusivity  and  A  is  the  local  rate  of  strain.  In  the  previous  section  it  was  shown  that  the 
variance  evolves  according  to 

(O'2)  ~  e~xt , 

where  A  is  a  local  rate  of  strain. 

From  simple  arguments,  molecular  diffusivity  alone  is  an  exceedingly  weak  method  for 
mixing  scalar  fields.  We  may  then  ask  how  stirring  might  improve  tracer  mixing  and,  more 
importantly,  how  we  can  quantify  the  effects  of  stirring  by  invoking  an  effective  diffusivity. 
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2.1  A  random  walk  model 

The  advection-diffusion  equation  for  a  passive  scalar  8  =  9(x,  t )  can  be  written  as 


—  +  u  ■  V9  =  k\729,  (18) 

ot 

where  u  is  a  given  velocity  field  and  k  is  a  constant  of  diffusivity.  Even  in  the  absence  of 
stirring  and  the  hlamentation  process,  this  diffusivity  can  act  to  mix  the  tracer  field  8  by 
monotonously  decreasing  the  variance  from  equation  (9). 

The  long  timescales  associated  with  diffusion  alone  indicates  that  the  physical  act  of 
stirring,  or  the  advective  term  u  ■  V 8  of  (18),  is  necessary  to  efficiently  mix  a  tracer  held. 
However,  the  advective  term  of  (18)  is  more  complex  than  the  diffusivity.  Ideally,  one 
could  write  the  effects  of  both  the  mixing  of  the  tracer  held  by  molecular  diffusivity  and 
by  advective  stirring  as  an  effective  diffusivity,  such  that  the  equations  of  motion  can  be 
approximated  by 

BO 

—  =  K\72e  -u-V8k  KeV28.  (19) 

We  consider  the  displacement,  x,  of  an  individual  particle  stirred  by  a  random  how  held. 
We  would  like  to  compare  the  displacemnt  of  the  particle  to  a  true  random  walk, 

Xn  —  Xn—  1  T  £n  (20) 

where  xn  is  position  of  the  particle  at  time  n  and  where  £ n  is  a  random  kick.  Here  we 
assume  that  all  £n  are  independent  and  identically  distributed  random  variables. 

At  the  nth  step,  the  particle  position  xn  satishes 

n 

xn  =  X0  + ^2  tk 

k= 1 

where  xq  is  the  initial  position.  If  the  initial  position  of  the  particle  is  xq  =  0  and  the  mean 
individual  kick  (£n)  =  0,  then  the  mean  position  of  the  particle  at  time  n  is 

n 

{ Xn )  =  ^(Zk)  =  0 
k=  1 

and  the  action  of  the  random  variations  in  the  how  held  given  by  £  do  not  change  the  mean 
position  of  the  particle. 

Consider  instead  the  mean-squared  displacement  of  the  particle: 

n 

{ xl )  =  ^2=  n(f)  =  n(j2’  (21) 

k= 1 

where  cr2  =  (£2).  The  mean-squared  displacement  of  the  particle  then  grows  linearly  with 
the  number  of  steps,  n,  at  a  rate  determined  by  the  standard  deviation  of  the  random 
forcing  held. 


5 


For  a  process  with  an  average  length  of  time  between  kicks,  T,  the  number  of  kicks  can 
be  converted  into  a  time  by  letting  t  =  nT.  The  squared  displacement  in  (21)  then  grows 
linearly  in  time  as 

(: x2(t ))  =  2  Kt  (22) 

where  k  =  a2  /  2 T  is  a  diffusivity,  or  a  measure  of  how  quickly  the  squared  displacement  of 
the  particle  grows  in  time. 

The  above  calculations  are  only  in  one  dimension.  In  d  dimensions,  the  mean-squared 
distance  from  the  origin  can  be  written  as  a  sum  of  the  mean-squared  distance  (xl  )  along 
each  dimension  k  at  timestep  n, 

d 

(^^xt,n)  =  (X1  ,n)  +  (x2,n)  +  •••  +  (xd,n) 
k= 1 

which,  assuming  the  random  forcing  is  isotropic  such  that  (x\  n)  =  (x2),  can  also  be  written 
as 


d(x2)  =  dna2  =  2  dnt  (23) 

where  k  =  a2 /2T  is  the  same  as  in  the  one  dimensional  case. 

The  diffusivity,  k,  can  be  thought  of  as  an  effective  diffusivity  as  it  arises  from  the 
physical  stirring  generated  by  the  stochastic  process  £n.  The  effective  diffusivity  is  again 
given  by 


(24) 


where  a2  is  the  standard  deviation  of  the  particle  displacement  due  to  the  random  kicks  £n 
and  T  is  the  average  interval  between  kicks. 


^eff 


a 

2 T 


2.2  Particle  concentration  and  diffusivity 

The  random  walk  description  of  a  diffusive  process  above  applies  only  to  a  single  particle 
with  position  given  by  xn.  We  wish  to  consider  the  evolution  of  a  cloud  of  particles.  Let 
6{x.  t )  be  the  density  of  particles  within  a  given  area  (or  volume  in  3D);  then  9  satisfies  the 
diffusive  equation 

=  Ke\720  (25) 

dt  y  ’ 

if  each  point  evolves  independently  according  to  a  random  walk  xn+\  =  xn  +  £n. 

The  idea  of  a  cloud  of  particles  only  works  if  the  spatial  and  temporal  resolution  of  the 
model  are  large  enough  such  that  the  motion  of  individual  particles  are  decorrelated.  If 
we  are  zoomed  in  too  closely,  such  that  the  grid  scale  is  less  than  the  standard  deviation 
of  the  displacements  ( L  <  a),  then  we  see  only  individual  particles  rather  than  a  cloud  of 
particles  with  a  given  concentration,  9.  If  we  choose  sampling  timescales  that  are  too  short, 
such  that  the  timescale  of  the  flow  is  less  than  the  interval  between  kicks,  T,  then  the  kicks 
become  correlated  in  time. 
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2.3  Numerical  examples  for  sine  flow 

Consider,  for  example,  the  advection  of  a  cloud  of  particles  by  the  famous  sine  flow  in 
a  doubly-periodic  domain  [2].  The  velocity  field  of  the  sine  flow  alternates  between  a 
horizontally-aligned  shear  flow  and  vertically-aligned  shear  flow  given  by 

uh  =  (U  sin  (27 xky/L)  ,0)  for  0  <  t  <  ^r; 
uy  =  (0  ,  U  sin  (2irkx/L))  for  \t  <t  <  t, 

where  here  U  is  a  constant  velocity,  k  is  the  wavenumber  of  the  shear  flow  field,  L  is  the 
size  of  the  biperiodic  domain,  and  r  is  the  period  of  the  flow. 

We  then  obtain  a  map  of  the  position  of  the  particle  by  solving 


X  =  U,  X  =  Xq 


exactly  as  a  two-step  process: 

Step  1  :  x(r/2 )  =  xq  +  U\  sin  ^  2n^yo  j 

y(r/2)  =  y0 


Step  2  :  x(r)  =  x{t / 2) 

y(T)  =  y[r/ 2)  +  U\  sin  (27r^r/2))  . 

At  the  end  of  an  interval  of  length  r,  the  new  position  (a of  a  particle  that  started  at 
(x,  y)  is  then 


x  =  x  +  T  sin 


27 rky\ 

~T~  ) 


y  =  y  +  T  sin 


2irkx'  \ 

) 


(26) 


where  T  =  Ut/2.  For  a  constant  velocity,  U,  T  is  proportional  to  r  and  the  intensity  of 
stirring  can  be  manipulated  by  varying  T.  Note  that  in  the  second  step  of  (26),  y' ,  is  a 
function  of  x'  and  not  x.  This  is  necessary  to  conserve  area,  as  can  be  verified  by  computing 
the  Jacobian  of  the  map  (26). 


2.4  Diffusivity  and  the  limit  of  small  T 

We  can  vary  the  period  of  the  sine  flow  given  in  (26)  by  varying  T.  For  small  T,  the 
period  of  the  stirring  is  less  than  the  timestep  of  integration  and  one  can  expect  the  flow  to 
appear  nondiffusive.  Indeed,  in  the  limit  of  small  T,  the  sine  map  given  in  (26)  approaches 
a  symplectic  integrator  [3].  This  result  can  be  obtained  by  noting  that 


x'  —  x 


T 


sin 


2irky\ 
— ) 


y  -y 


—  sm 


2itkx'  \ 

) 
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Figure  3:  Streamfunction  in  the  limit  that  T  — >•  0  .  The  flow  field  then  satisfies  u  = 
(dyip,  —dxip)  so  that  the  flow  is  clockwise  where  ip  is  negative  (dashed)  and  counterclockwise 
where  ip  is  positive  (solid). 


and  by  taking  the  limit  as  T  — >  0.  In  this  limit, 


^  etc.  such  that 


dx  (  2n rky\ 

—  =  sin  — - — 

dt  \  L  J 


dy  ( 2nkx 

—  =  sin  — - — 

dt  V  L 


(27) 


Equation  (27)  is  equivalent  to  saying  the  nondivergent  velocity  field,  u  =  dx/dt ,  can  be 
written  in  terms  of  a  streamfunction  u  =  (dyip,  —dxip)  where 


ip 


L 

2nk 


—  cos 


2irky\ 

L  ) 


(28) 


A  contour  plot  of  ip  is  given  in  figure  3. 

Consider  now  the  advection  of  a  cloud  of  particles  displaced  by  the  map  (26).  If  we  let 
L  =  k  =  1  and  use  a  relatively  small  value  of  T,  T  =  0.1,  the  flow  follows  nearly  closed 
orbits  as  given  in  figure  4.  As  this  flow  closely  approximates  the  streamfunction  solution, 
the  stirring  given  by  (26)  does  not  diffuse  the  concentration  of  particles.  As  a  result,  the 
use  of  an  ’effective’  diffusivity  does  not  apply  here. 

While  the  orbit  given  in  figure  4  is  not  strictly  confined  to  the  streamfunction  in  (28) 
because  of  the  use  of  a  finite  T,  it  is  not  space  filling.  We  can  add  noise  to  the  sinemap 
though 


x  =  x  +  T  sin 


+  \J2D  a 


y'  =  y  +  T  sin 


2tt  kx'\ 

) 


+  V2Db 


(29) 


where  a  and  b  are  Gaussian  random  variables  with  zero  mean  and  unit  variance.  Here  D  is  a 
constant  that  controls  the  amplitude  of  the  noise  and  can  be  consider  a  molecular  diffusivity. 
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itef  =  7,  T *  1  L=1  0  =  0 


(c)  D  =  0,  T  =  1,  and  it  =  7 


Figure  4:  Particle  concentration  in  a  box  of  sides  L  =  1  and  stirring  timestep  T  =  1  after 
20  iterations.  There  is  no  noise  in  (a)  and  the  map  is  confined  near  the  streamlines  given 
in  3.  With  the  addition  of  noise,  where  D  =  10-5,  the  orbit  becomes  diffusive.  Figure  4(c) 
is  the  same  as  in  4(a)  but  with  T  =  1  and  is  shown  only  at  the  7th  iteration. 


In  the  absence  of  stirring,  (29)  becomes  the  two-dimensional  version  of  the  random  walk 
model  given  in  (20).  As  we  have  chosen  (a2)  =  ( b 2)  =  1,  the  standard  deviation  a2  of  the 
noise  is  simply  2D. 

If  we  let  D  =  10~5  (figure  4(b)),  the  near-integrability  of  the  map  of  figure  4(a)  is 
broken,  however  the  flow  still  remains  constrained  to  near  the  streamlines  in  (28)  for  a 
moderate  number  of  iterations.  The  addition  of  this  molecular  diffusivity,  D ,  will  cause  the 
trajectory  to  eventually  fill  the  domain.  However,  this  process  is  quite  slow. 

The  diffusive  behavior  captured  by  adding  random  noise  in  figure  4(b)  can  also  be 
obtained  by  increasing  the  step  size  even  in  the  absence  of  noise.  For  a  box  size  L  =  1  and 
with  D  =  0  and  a  timestep  of  T  =  1,  ten  times  longer  than  that  of  the  examples  given 
in  figure  4,  the  flow  no  longer  follows  the  streamfunction  given  in  (28).  The  flow  in  figure 
4(c)  is  chaotic  and  quite  rapidly  fills  the  entire  domain  with  the  tracer  after  only  seven 
iterations. 

2.5  Filament  width  and  the  Batchelor  length 

The  Batchelor  length,  l  =  yj k/X ,  is  the  approximate  scale  at  which  filaments,  stretched 
by  stirring,  become  so  thin  that  the  tendency  for  the  filament  to  become  narrower  due  to 
stirring  is  balanced  by  the  diffusion  of  the  patch.  In  the  definition  of  the  Batchelor  length,  A 
is  the  local  rate  of  strain.  In  the  sine  map  given  in  (26),  modified  with  the  diffusion  equation 
of  (29),  the  rate  of  strain  is  approximately  T  and  the  coefficient  of  ”  molecular”  diffusion  is 
D.  For  this  problem,  then,  the  width  of  the  stirred  filaments  should  scale  approximately  as 

l  =  VlTT  (30) 

such  that  the  Batchelor  length  scales  with  \J~D  for  fixed  T . 

Snapshot  of  the  concentration  fields  for  ‘strong’  diffusivity,  with  D  =  10-4,  and  ‘weak’ 
diffusivity,  with  D  =  10~6,  are  given  in  5(a)  and  5(b),  respectively.  The  width  of  a  typical 
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0.5 


iter  =  3  T  =  1  L  =  1  D  :  0 0001 


iter  =  3.  T  =  1  L  =  1  D  =  1e-06 


Figure  5:  Particle  concentration  in  a  box  of  sides  L  =  1  with  T  =  1  and  with  varying  D  at 
iteration  3.  The  width  of  the  filaments  scale  as  the  Batchelor  length  given  in  (30). 


filament  in  the  ‘strong’  diffusivity  case  is  0.109  nondimensional  units  while  a  ‘weakly’  diffu¬ 
sive  filament  is  approximately  0.006  nondimensional  units.  The  width  of  the  filaments  for 
these  two  cases  do  roughly  scale  with  y/D  as  suggested  from  (30). 

2.6  Effective  diffusivity  and  the  domain  size,  L 

For  the  effective  diffusivity  scaling  given  in  (25)  to  hold  for  a  stirred  fluid,  the  length  scale 
of  the  stirring  must  be  less  than  that  of  the  domain.  In  the  examples  above,  the  length  scale 
of  the  stirring  was  the  same  as  that  of  the  domain.  If  we  increase  the  size  of  the  domain 
to  L  >  1  while  holding  T  at  an  intermediate  value,  T  =  1/2,  the  concept  of  an  ‘effective’ 
diffusivity  becomes  relevant. 

Shown  in  figure  6  are  snapshots  of  the  particle  concentration  field  after  ten  iterations  for 
a  domain  size  varying  from  L  =  1  to  L  =  25  while  the  spatial  correlations  ( x 2),  ( y 2),  and 
( X 2)  =  ( x 2  +  y 2)  and  cross-correlations  ( xy )  are  shown  in  figure  7.  For  small  domain  sizes, 
such  as  in  6(a)  and  6(b),  the  sine  flow  stirring  efficiently  fills  the  domain  with  the  tracer. 
This  homogenization  of  the  tracer  field  is  evident  in  6(c)  and  6(d)  by  the  leveling  off  of  the 
correlation  values  with  time. 

In  cases  with  a  relatively  small  domain  size,  such  as  6(a)  and  6(b),  the  spatial  variances 
for  ( x 2),  (y2,  and  ( X 2)  do  not  grow  linearly  with  time  as  predicted  from  the  effective 
diffusivity  equations  in  one  dimension,  equations  (22)  and  (23).  The  concept  of  an  effective 
diffusivity  fails  in  cases  where  the  stirring  is  at  approximately  the  same  scale  as  the  domain, 
as  predicted. 

Increasing  the  domain  size  to  L  =  10  and  L  =  25  while  holding  the  scale  of  the  stirring 
fixed,  however,  shows  a  more  promising  diffusive-like  behavior.  The  tracer  concentration  in 
6(c)  and  6(d)  spreads  outward  from  an  initial  patch  due  to  the  stirring  with  spatial  variances 
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Iter :  10  T  :  O  S  1=1  D  =  1e-06 


iter :  10  T  =  0.5  L  =  3  D  =  1e-06 


(c)  L  =  10 


(d)  L  =  25 


Figure  6:  Particle  concentration  in  a  box  of  of  domain  size  of  L  =  1  in  6(a),  L  =  3  in  6(b), 
L  =  10  in  6(c),  and  L  =  25  in  6(d)  with  T  =  1  and  with  D  =  lx  10-6  at  iteration  10. 


growing  linearly  with  time,  as  shown  in  figures  7(c)  and  7(d). 

Note  that  the  variances  in  the  x  and  y  directions  for  all  domain  scales  indicate  isotropy,  as 
shown  by  the  similar  behavior  of  ( x 2)  and  ( y 2)  in  figure  6.  This  agrees  with  the  assumption 
that  the  variances  in  multiple  dimensions  have  the  same  statistics/standard  deviation  as  we 
assumed  in  the  derivation  of  the  multi-dimensional  form  of  the  effective  diffusivity  equation, 
(23).  Additionally,  the  cross-correlation  between  the  x  and  y  spatial  dimensions,  (xy),  is 
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Figure  7:  Calculations  of  the  spatial  variance  for  the  examples  shown  in  6  as  a  function  of 
iteration  and  increasing  domain  size. 


zero  at  long  times. 

From  (23),  we  calculate  the  approximate  effective  diffusivity  for  the  L  =  25  example 
as  At  =  0.068,  four  orders  of  magnitude  larger  than  the  molecular  diffusivity,  D  =  10~6  - 
the  stirring  in  this  case  more  efficiently  disperses  the  tracer  field  than  molecular  diffusivity 
alone. 


3  Matlab  Codes 

3.1  run.sinemap .m 


Listing  1:  run_sinemap.m 

function  [varargout]  =  run_ s inemap (T , L , D , Np , sqw) 

if  nargin  <  1,  T  =  1;  end 
if  nargin  <2,  L  =  1;  end 
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if  nargin  <  3 

7,  Diffusivity  (noise) 

D  =  0; 
end 

if  nargin  <  4 

7,  Points  initial  square 
Np  =  100; 
end 

Npx  =  Np ;  Npy  =  Np ;  Np  =  Npx*Npy; 

if  nargin  <  5 

7,  Size  of  initial  square 
sqw  =  .2; 
end 

f  =  @(X)  sinemap (X , T/L , 1/L) ; 

x  =  1 inspace (- sqw/2 , sqw/2 , Npx )’ ; 
y  =  linspace (-sqw/2 , sqw/2 , Npy )’ ; 

X  =  [repmat  (x  ,  [Npy  1])  kron (y , ones (Npx , 1) )] /L ; 

dX2  =  mean  (  sum  (mymod  (L*X  ,  L)  .  ~2 , 2)  )  ; 
dx2  =  mean (mymod (L*X  (  :  ,1)  ,L)  .~2)  ; 
dy2  =  mean (mymod (L*X ( :  ,2)  ,L)  .~2)  ; 

dxy  =  mean (mymod (L*X  (  : ,1) ,L) . *mymod (L*X ( : ,2) ,L)) ; 

it  =  0 ; 
figure  (1) 

plotmap  (L*X  ,  T  ,  L  ,  D  ,  it  )  7«  Stretch  output, 

while  1 

inp  =  input ( ’Press  a  key  (Q  to  quit)’, ’s’); 
if  str cmp ( lower ( inp ),  ’  q ’ )  ,  break;  end 
X  =  f  (X)  ;  it  =  it  +  1 ; 

dX2  =  [dX2;  mean  (  sum  (mymod  (L*X  ,  L)  .  ~2 , 2)  )  ]  ; 
dx2  =  [dx2;  mean (mymod (L*X (:,  1)  ,  L)  .  ~2) ]  ; 
dy2  =  [dy2;  mean (mymod (L*X ( : , 2) , L) . "2) ] ; 

dxy  =  [dxy;  mean (mymod (L*X (:, 1)  , L)  . *mymod (L*X (:, 2)  , L) )]  ; 

7,  Add  some  noise 

if  D  >  0 ,  X  =  X  +  sqrt (2*D) /L*randn ( size (X) ) ;  end 
figure  (1) 

plotmap  (L*X  ,  T  ,  L  ,  D  ,  it  )  7«  Stretch  output, 

end 

hold  off 

if  nargout  >  0 

varargout{l}  =  dX2 ; 
figure  (2) 

plot  (0  :  it  ,  dX2  ,  ’k  .  -  ’  ) 
hold  on 

plot (0: it ,dx2 , ’ r . -- ’ ) 
plot (0: it ,dy2 , ’ b . -  -  ’  ) 
plot  (0:  it  ,  dxy  ,  ’ g . -  -  ’  ) 
hold  off 

xlabel ( 5  iteration  ’ ) 
ylabel ( ’ <X^2>  ’  ) 

legend(,<X'‘2>,  ,  5<x"2>5  ,  ’<y~2>’  ,  ’<xy>’  ,  ’Location’  ,  ’Northwest  ’) 
end 


•/#  - 

function  Xm  =  mymod(X,L) 

Xm  =  mod ( X  +  L/2,L)  -  L/2; 

•/#  - 

function  plotmap (X , T , L , D  ,  it  ) 

plot (mymod (X ( :  ,1)  , L)  , mymod (X(:  ,2)  ,L)  ,  ’  .  ’  ,  ’MarkerSize ’  ,1) 
axis ( [-L/2  L/2  -L/2  L/2]) 
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axis  square 

t  itle  (  sprintf  (  5  iter  =  °/0d  ,  T  =  #/0g  L  =  °/«g  D  =  °/«g  ’  ,  it  ,  T  ,  L  ,  D )  ) 
if  L  >  1 

gr  =  ( 0 : L - 1 )  -  f loor (L/2-.5)  -  .5; 
set (gca  , ’ XTick  ’ ,gr) 
set (gca  , * YTick * ,gr) 
set ( gca  ,  ’XTickLabel’  ,  []) 
set ( gca  , * YTickLabel J , []) 
grid  on 
end 


sinemap . 

in 

Listing  2 

:  sinemap. m 

fun 

ct  ion 

Xp  = 

sinemap (X , T , L) 

if 

nargin 

<  2. 

,  T  =  1;  end 

if 

nargin 

<  3. 

,  L  =  1 ;  end 

Xp 

=  zero 

s  ( s  ii 

se  (X)  )  ; 

Xp( 

:  ,1)  = 

X  (  :  , 

,1)  +  T*  sin  (2*pi*X  (  :  ,! 

2)/L) ; 

Xp( 

:  ,2)  = 

X  ( :  , 

,2)  +  T* sin (2*pi *Xp  (  : 

,1)/L)  ; 
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Introduction  to  Biological  Models 


Lecturer:  Glenn  Flierl 
May  11,  2011 


1  Introduction 

The  ocean  is  inhabited  by  innumerable  individuals  of  many  different  genera  and  species, 
each  having  its  own  developmental/physiological  state  and  each  being  immersed  in  its  own 
environment.  The  organisms  move,  both  because  of  water  flow  and  because  of  their  own 
swimming  or  buoyancy,  and  interact  with  their  environment  by  gathering  resources  which 
they  need  and  by  excreting  waste  products.  The  assimilated  material  can  be  used  for 
maintenance,  growth,  or  reproduction.  Finally,  the  organisms  can  die  either  from  natural 
causes  or  because  of  attacks  by  another  organism. 

Furthermore,  the  processes  just  described  must  generally  be  regarded  as  stochastic.  For 
example,  the  probability  of  a  predator  capturing  a  prey  item  will  depend  on  multiple  factors, 
each  with  its  own  probability:  finding  a  prey  item  in  range,  the  choice  to  attack,  success 
in  the  attack,  competition  against  others.  Such  a  description  suggests  an  “agent-based”  or 
“individual-based”  model  (IBM),  with  each  agent  carrying  information  about  its  position, 
its  species,  its  physiological  state,  etc.  Individuals  can  grow,  reproduce,  and  die.  Certainly, 
we  can  build  small  versions  of  such  models,  but  the  number  of  individuals  is  necessarily 
limited  (compared  to  nearly  20,000  copepods  per  cubic  meter  or  to  phytoplankton  densities 
on  the  order  of  108  per  cubic  meter).  However,  such  experiments  may  indeed  give  insight 
into  the  way  in  which  the  local,  stochastic  interactions  translate  into  terms  representing, 
for  example,  grazing  rates  in  terms  of  average  densities. 

Or  we  could  take  the  alternative  view  of  attempting  to  predict  the  probability  distribu¬ 
tion  for  biomass  in  a  continuous  space,  in  this  case  using  something  like  weight  and  species 
as  our  variables.  The  latter  is,  of  course,  discrete,  yet  different  organisms  can  be  genetically 
or,  more  importantly,  functionally  quite  close  to  others.  If  we  choose  a  species  ordering  such 
that  the  maximum  growth  rate  varies  smoothly,  we  may  expect  that  other  terms  entering 
the  dynamics  such  as  the  losses  by  predation  will  also  fall  on  a  fairly  smooth  curve.  Cer¬ 
tainly  on  any  diagram  such  as  figure  (1)  the  gaps  will  be  so  small  as  to  be  negligible,  and 
viewing  the  ordinate  as  a  continuous  variable  is  not  unreasonable.  We  can  then  consider 
the  ways  that  the  processes  described  above  alter  the  biomass  distribution  in  this  space. 

As  in  the  individual-based  model,  the  number  of  variables  we  would  have  to  consider  is 
still  unmanageably  large.  Furthermore,  for  each  (w,  sp ),  we  need  to  specify  the  sources,  the 
sink,  and  the  transfer  rates,  including  possible  nonlinear  dependence  on  the  local  biomass 
6(tc,-s|x,  t)  and  the  density  at  the  source/sink  b(w,s\x,  t). 

Any  attempt  to  construct  such  a  model  or  the  IBM  model,  will  inevitably  point  out 
how  little  we  know  about  most  of  the  species  inhabiting  the  ocean.  We  retreat  to  dealing 
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Figure  1:  The  range  of  sizes  for  various  species 


with  variables  such  as  P  which  can  be  viewed  as  the  integral  over  some  range  (si—  >  s2) 
of  species  and  over  weight.  We  then  presume  that  transfers  into  and  out  of  the  resulting 
black-boxes  can  be  represented  as  functions  just  of  the  integrated  values  and  attempt  to 
parameterize  those. 


2  Basic  biological  models 

Exponential  growth 


Suppose  each  individual  (or  pair)  reproduces  in  a  time  order  r,  then  the  time  evolution 
of  the  population  would  be  described  as 


b(t.  +  St)  =  [1  +  gSt]b(t.),  g  =  ln2/r 
^ =  gb  =>-  b(t)  =  bQexp(gt). 


(1) 

(2) 


where  b  is  the  population  and  g  is  its  growth  rate.  This  model  predicts  an  unrealistic  ex¬ 
ponential  growth  of  the  population  (see  figure  (2)).  Thus,  some  limiting  factors  should  exist. 


Resource  limitation 


For  example,  one  could  write  a  model  with  the  growth  rate  depending  on  a  resource  R 
which  itself  evolves  with  time: 


f  =  <»*-«>* 

r)  /? 

—  =  -gRb-X(R-R0) 


(3) 

(4) 
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Figure  2:  USA  population  as  a  function  of  time  showing  exponential  growth  at  initial  stage, 
but  a  decay  of  a  growth  rate  at  later  stages 

Here,  the  population  evolves  with  a  growth  rate  gR  and  a  death  rate  d.  The  resource  de¬ 
creases  due  to  consumption  by  population  and  is  restored  to  R0  with  a  time  scale  1/A.  This 
model  allows  for  a  steady  state  solution  with  R  =  d/g,b  =  —\{d/g  —  R0)/d.  However,  the 
growth  rate  is  unconstrained  with  increase  of  the  resource;  a  more  realistic  growth  model 
would  have  to  limit  it  at  R  — >  oo.  One  possibility  is  to  use  the  form  g^j:- 

Predator-pray 

Assume  that  zooplankton  (Z)  grows  by  consuming  phytoplankton  (P)  and  phytoplank¬ 
ton  grows  due  to  source  of  nutrients  (N)  assumed  to  be  fixed;  both  have  natural  death  rates 
dp,  dz 

gPZ  -  gNP  -  dPP  (5) 

agPZ  —  dzZ  (6) 

This  model  (the  Lotka-Volterra  equations)  gives  a  solution  with  cycles  that  have  a  con¬ 
served  quantity  H  =  agP  —  dzlnP  +  pNZ  +  dplnZ.  The  quadratic  NPZ  model  assumes 
N  =  Np  —  P  —  Z,  where  Np  is  a  total  amount  of  nutrients.  This  model  has  three  steady 
states  if  Np  >  (dz/otg)  +  ( dp//j, ),  with  a  P,  Z  non-zero  point  being  a  stable  attractor. 

NPZD 

The  NPZ  model  assumes  all  dead  organisms  or  excreted  material  is  immediately  rem¬ 
ineralized  to  usable  nutrient.  In  contrast,  the  NPZD  model  assumes  that  dead  organisms 
and  unassimilated  phytoplankton  would  contribute  to  a  detrital  pool  that  eventually  be- 


dP 

~dt 

dZ_ 

~dt 
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Figure  3:  Schematic  representation  of  NPZD  model  showing  the  fluxes  of  biomass  and  the 
parameters  controling  them 


comes  a  source  of  nutrients.  A  schematic  representation  of  this  model  is  shown  in  figure  (3), 
and  a  typical  version  of  the  corresponding  equations  (including  sinking  of  detritus)  would 
be: 

W,p  ~  '■VT^F-!'pTfeZ-‘ipF  +  v'(KVF)  <7) 

JjiZ  =  09 p  +  kZZ  ~dzZ  +  V'  (8) 

prD  +  -j-wsD  =  (l-a)g  P  Z  +  dPP  +  dzZ  -  rD  +  V  •  (acV D)  (9) 

Dt  oz  F  +  kz 

WtN  =  rD-^p+v'(*™l  (“) 


where  ^  +  u  •  V  denotes  the  material  derivative  of  the  advected  scalar. 


In  general  case  the  system  equations  have  the  form 


-bi  +  V  •  (u ibi)  =  Bi(x,t |b)  +  V  •  ((k  +  Ki)Vbi) 


Here,  u,;  and  Kt  represent  the  motion  relative  to  the  fluid.  The  function  Hj(x,  t|b)  represents 
interactions  between  different  parts  of  the  system.  Some  times  a  condition  ^  £>,;(x,  i|b)  =  0 
is  used,  which  assumes  that  the  total  biomass  is  conserved;  this  reduces  the  order  of  the 
system. 

Such  biological  models  require  a  large  number  of  parameters.  For  example,  in  the 
NPZD  model  with  4  variables,  the  parameters  are:  fi.  kp,  kz,  g,dp,dz,a,r  in  addition  to  the 
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biological  movement  term  ws  and  physical  parameters  such  as  n;  together,  they  determine 
the  fundamental  behavior  of  the  system.  Thus,  in  addition  to  the  mathematical  question 
of  the  behavior  of  the  equations  for  different  parameter  values,  modellers  need  to  decide 
on  numbers  which  seem  to  give  “realistic”  solutions.  Since  the  variables  are  not  generally 
directly  observed,  comparisons  with  data  tend  to  be  pretty  qualitative  in  an  case. 

Controlled  (i.e.,  laboratory)  experiments  can  be  a  guid  for  parameter  choicese;  however 
the  behavior  of  the  organisms  might  be  affected  by  the  laboratory  constraints  so  that  things 
like  predation  rates  may  be  quite  different.  Another  way  is  to  observe  the  biosystem  in  the 
nature,  but  in  reality  the  system  is  much  more  complicated  than  the  model,  and  this  can 
also  introduce  large  uncertainties. 

3  Stochastic  Dynamics 

An  alternative  method  for  modeling  organism’s  population  dynamics  is  through  the  use 
of  probability  density  functions  for  the  population  size.  The  probability  density  functions 
will  be  written  as  V(n,t)  which  represents  the  probability  of  having  a  population  of  size 
n  at  time  t.  In  case  of  multiple  interacting  populations,  the  evolution  of  the  population 
probability  is  based  on  the  equation 

V(z,  t  +  St)  =  Vt{Sz |z  —  Sz)V{z  —  Sz,  t )  (13) 

where  z  is  a  vector  representing  the  population  of  the  various  species  considered  in  the 
model  and  Vt  is  the  transition  probability.  This  equation  is  essentially  an  application  of 
the  law  of  total  probability.  The  probability  of  a  specific  population  at  a  future  time  step 
is  the  sum  of  all  possible  previous  populations  times  the  probability  of  transitioning  to 
the  specified  population  during  the  given  time  step.  For  one  of  the  species,  the  transition 
probability  can  be  modeled  as 

{ndn5t  Sn  =  —  1 

1  ~  ngnSt  —  ndnSt  Sn  =  0 
ngnSt  Sn  =  1 

where  gn  and  dn  are  the  growth  and  death  rates  per  capita  depending  on  the  population 
size  n. 

To  make  this  more  tractable,  let  us  consider  a  case  where  only  one  population  is  modeled. 
For  simplification  we  will  change  the  notation  from  V(n ,  t)  to  be  Vn(t).  Equation  13  becomes 

Vn{t  +  St)  =  Vn-i{t)(n  -  l)gn-\St  +  Vn(l  -  ngnSt  -  ndnSt)  +  Vn+i (n  +  l)dn+iSt 
^nly  +8t>  'Pn  =  'Pn-iitf^n  —  l)g„.-i  —  Vnn(gn  +  dn)  +  Vn+\(n  +  l)dn+\ 

=  Vn-i(n-l)gn-i-Vnnl{gn  + dn) +  Vn+i{n  +  l)dn+i 

O') 

From  this  the  first  few  equations  can  be  calculated  to  give  a  foundation  for  insight 

&Vo  =  Vxdx 

=  -Pi  (91  +di)  +  2V2d2  . 

iV 2  =  Vigi~2r2(g2  +  d2)  +  3V3d3  1  j 

^ \V3  =  2T)2<?2  —  3V3(g3  +  d3)  +  AV3d4 
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Now  we  will  look  at  the  expectation  of  certain  properties  and  will  define  the  functional  form 


^  fm  ^ ^  '  fmVnV 
n 


Clearly  the  sum  of  the  probabilities  at  a  given  time  is  constant  so 


—  <  1  >=  0. 
dt 


A  more  relevant  calculation  is  the  understanding  of  the  time  rate  of  change  of  the  expected 
population  change  which  can  be  shown  to  be 


d  , 

—  <n  >=<  n(gn 


dn)  >  • 


If  however,  the  death  and  growth  rates  are  independent  of  the  population  size  than  the 
constant  can  be  pulled  outside  of  the  expectation  calculation,  so 


dt<n>=  (9" 


dn)  <n> 


which  has  a  similar  form  to  the  deterministic  population  model.  Additionally  it  can  be 
shown  that  given 


d_ 

dt 


<  n2  >=  2  <  n2(gn 


dn)  >  +  <  n(gn  +  dn)  > 


and  a2  =<  n2  >  —  <  n  >2,  it  is  possible  to  calculate  the  long  term  trend  of  the  ratio 
between  the  standard  deviation  of  the  population  (<t)  and  the  expected  population  size 
with  growth  and  death  rates  independent  of  population  size 


lim  - 

t-yo o  <n> 


N 


o"o  +  <n>  o 

<  n  >o 


g+d 

g-d 


where  do  and  <  n  > o  are  the  initial  standard  deviations  and  the  expected  population  size. 
If  we  compare  a  set  of  realizations,  we  would  find  the  range  of  variation  increases  with  time 
just  as  the  mean  size  is  increasing. 

For  populations  which  can  double  in  a  day,  the  exponential  model  cannot  apply  for  long. 
We  can  examine  the  logistic  model  with  gn  =  go(l  —  n/no),  in  which  case  the  distribution 
can  reach  a  nearly  state.  It  cannot  be  completely  steady,  since  a  constant  value  of  V\  implies 
the  extinction  probability,  Vo,  is  continuing  to  grow. 


3.1  Fokker-Planck  Expansion 

when  the  number  of  individuals  is  very  large,  V(n ,  t )  can  be  treated  as  a  smooth  function  of 
n,  regarded  as  a  real  number.  If  we  start  again  with  the  Equation  13,  the  following  Taylor 
expansion  can  be  performed  to  approximate  the  dependence  on  the  vector  of  population 
sizes 

V(z,t)  +  5t^V(z,t)  =  J]^,T(5z|z,t)'P(z,t)  -  ^:J2SziVT(Sz\z,t)V(z,t) 

+  ldS^^5zi6zjVT(6z\ Z,t)V{z,t) 
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Figure  4:  Plot  of  the  U  and  K  for  a  linearly  deacreasing  growth  rate  and  a  constant  death 
rate  and  the  resulting  population  density  distribution. 


where  V(z.  t )  is  still  the  population  probability  function  dependent  on  the  coordinate  z  and 
time,  t ,  and  all  Ts  represent  a  differential  step  in  the  provided  coordinate.  This  Taylor 
expansion  can  be  simplified  to 


d 2 

dz,dz-, KtjV 


(16) 


where  the  substitutions 

Ui(z,t)  =  Y  Ki:j(z,t)  =  Y  ~T^L'Pt^ z|zT) 

have  been  made.  For  the  case  like  14  considering  birth  and  death  rates  in  one  species, 
U  =  n(gn  —  dn)  and  K  =  \n(gn  +  dn).  An  example  growth  rate  that  can  be  used  is 
9n  =  5o(l  —  n/no)  which  corresponds  to  a  linearly  decreasing  growth  rate  for  populations 
under  a  maximum  population,  uq.  With  a  death  rate  at  a  fixed  constant  the  function 
of  U  and  K  can  be  determined  and  are  presented  in  Figure  4.  The  resulting  population 
probability  distribution  function  is  a  narrow  Gaussian  centered  at  gn  =  dn. 


21 


Lecture  3:  Stirring  by  swimming  organisms 


Jean-Luc  Thiffeault 

Notes  by  Woosak  Moon  and  Renske  Gclderloos 
23  June  2010 


1  Introduction 

The  question  of  which  factors  contribute  to  mixing  in  the  ocean  has  been  the  subject  of 
many  studies.  The  idea  that  organisms  could  play  a  role  in  affecting  their  environment 
through  something  like  mixing  first  appeared  in  Darwin’s  last  book,  called  “The  formation 
of  vegetable  mould  through  the  action  of  earth  worms,  with  observations  on  their  habits.” 
(See  figure  1.)  This  was  suggested  by  his  uncle  and  future  father-in-law  Josiah  Wedgwood 
II,  son  of  the  famous  potter.  From  Darwin’s  book: 

In  the  year  1837,  a  short  paper  was  read  by  me  before  the  Geological  Society 
of  London,  “On  the  Formation  of  Mould,”  in  which  it  was  shown  that  small 
fragments  of  burnt  marl,  cinders  &c  which  had  been  thickly  strewed  over  the 
surface  of  several  meadows,  were  found  after  a  few  years  lying  at  the  depth  of 
some  inches  beneath  the  turf,  but  still  forming  a  layer.  This  apparent  sinking  of 
superficial  bodies  is  due,  as  was  first  suggested  to  me  by  Mr.  Wedgwood  of  Maer 
Hall  in  Staffordshire,  to  the  large  quantity  of  fine  earth  continually  brought  up 
to  the  surface  by  worms  in  the  form  of  castings.  These  castings  are  sooner  or 
later  spread  out  and  cover  up  any  object  left  on  the  surface.  I  was  thus  led  to 
conclude  that  all  the  vegetable  mould  over  the  whole  country  has  passed  many 
times  through,  and  will  again  pass  many  times  through,  the  intestinal  canals  of 
worms,  [emphasis  added]  Hence  the  term  “animal  mould”  would  be  in  some 
more  appropriate  than  that  used  of  “vegetable  mould.” 

The  modern  name  for  this  phenomenon  is  ‘bioturbation.’  At  the  time,  the  idea  that  many 
small  events  could  accumulate  to  create  something  much  bigger  was  quite  controversial  (and 
still  is  to  some  people,  in  the  case  of  evolution). 

It  was  not  until  almost  a  century  later  that  Walter  Munk  [8]  seriously  proposed  marine 
organisms  as  a  possible  factor  for  mixing  in  the  ocean.  However,  Munk  discounted  the 
effect  as  negligible.  The  idea  lay  dormant  for  forty  years,  until  2004  when  Huntley  & 
Zhou  [3]  analyzed  the  swimming  of  100  species  in  a  wide  range  of  sizes,  and  concluded 
that  for  11  representative  species  the  turbulent  energy  production  is  10  5  W  kg"1,  the 
total  of  which  would  add  up  to  an  amount  comparable  to  the  energy  dissipation  by  major 
storms.  Another  study  on  the  magnitude  of  the  mixing  effect  of  the  oceanic  biosphere  was 
performed  by  Dewar  et  al.  [2],  who  estimated  that  the  ocean’s  organisms  take  in  a  total 
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Figure  1:  Darwin’s  last  book,  “The  formation  of  vegetable  mould  through  the  action  of 
earth  worms,  with  observations  on  their  habits.” 


power  input  of  almost  63  TeraW.  They  assumed  that  about  1%  of  this  amount  is  delivered 
as  mechanical  energy,  which  leads  to  a  number  comparable  to  the  input  of  winds  and  tides 
[2].  An  experimental  study  by  Kunze  et  al.  [5]  showed  that  the  level  of  turbulent  activity  in 
an  inlet  was  elevated  by  2  to  3  orders  of  magnitude  during  the  day,  due  to  swimming  krill. 
(Though  it  must  be  mentioned  that  later  studies  were  far  less  successful.) 

In  reaction  to  the  papers  mentioned  above,  Visser  [10]  argues  that  very  small  organisms 
such  as  zooplankton  cannot  produce  sufficient  mixing  to  overturn  a  stratified  medium.  The 
author  points  out  that  the  efficiency  of  mixing  is  proportional  to  the  ratio  of  the  lengthscale 
of  the  organism  (L)  and  a  buoyancy  lengthscale,  defined  as  B  =  ( e/N 3)1^2,  where  e  is  the 
rate  of  turbulent  energy  dissipation  and  N  is  the  buoyancy  frequency.  When  L  <  B,  the 
mixing  efficiency  quickly  drops  by  several  orders  of  magnitude  [10],  rendering  biomixing 
negligible.  Katija  and  Dabiri  [4]  however  suggest  that  it  is  not  the  scale  of  turbulence 
produced  by  the  marine  organisms  that  is  the  relevant  quantity,  but  the  net  displacement 
of  fluid  particles  they  cause.  This  aspect  will  be  discussed  in  these  notes  by  constructing  a 
simple  model  of  stirring  by  swimming  organisms,  based  on  drift  trajectories  due  to  a  moving 
object. 

2  Displacement  of  a  fluid  particle  due  to  a  moving  body 

We  will  first  discuss  the  displacement  of  a  fluid  particle  due  to  a  single  moving  body,  and 
generalize  this  theory  in  the  next  section  to  multiple  ’kicks’  due  to  several  moving  bodies, 
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and  ultimately  to  an  effective  diffusivity.  For  simplicity  we  will  assume  the  moving  body 
to  be  2D  and  symmetric  with  respect  to  its  direction  of  motion.  (Axisymmetric  about  its 
direction  of  motion  in  3D.)  (See  figure  2.)  We  only  consider  potential  flow.  The  body 
moves  along  a  straight  line  with  constant  velocity  U  for  distance  A.  The  distance  between 
the  fluid  particle  and  the  straight  line  of  motion  is  a,  while  the  distance  from  the  start  of 
the  trajectory  to  the  point  of  closest  approach  (with  respect  to  the  initial  position  of  the 
particle)  is  b.  Together  a  and  b  are  called  impact  parameters.  Note  that  a  is  positive  but 
b  can  have  either  sign,  with  a  negative  b  indicating  a  swimmer  that  begins  its  trajectory 
already  having  passed  the  fluid  particle. 


Figure  2:  Displacement  of  a  fluid  particle  by  a  swimming  body. 


Due  to  the  movement  of  the  swimming  body,  the  fluid  particle  will  have  a  net  displace¬ 
ment  A  =  A\(a,b),  known  as  drift.  Although  Maxwell  [7]  was  the  first  to  discuss  the  drift 
to  a  moving  body,  it  is  often  referred  as  ’Darwinian  drift’  [1],  due  to  Charles  G.  Darwin, 
great  grandson  of  the  other  Darwin. 

We  first  need  to  determine  how  much  a  fluid  particle  is  displaced.  We  could  solve  this 
problem  in  a  fixed  frame  centered  around  the  moving  cylinder,  where  the  particle  moves 
around  the  body  (figure  3,  left  panel).  It  is  however  easier  to  work  in  a  frame  which  is 
moving  with  the  fluid  particle  (figure  3,  right  panel). 

In  the  x  direction,  the  displacement  of  the  particle  solely  due  the  flow  U  would  be  b  —  A 
(“free-streaming”).  Because  of  the  influence  of  the  moving  body,  the  fluid  particle  ends  up 
at  X{,  a  distance  Ax  away  from  b  —  A.  Now  we  can  define  a  travel  time  T  it  takes  the  fluid 
particle  to  go  from  b  to  xf 


T  _*_r  * 

U  Jh  u(x,  y) 

where  u(x,  y )  is  the  velocity  felt  by  the  fluid  particle, 
component  of  the  velocity  field  never  vanishes.  Using  Xf 


(1) 


This  assumes  that  the  horizontal 
=  b  —  A  +  Ax,  this  gives 


a  _  rb~x+ Ax  dx 

U~  Jh  v 


c  dx 

M 

dx 


[b  — 
lb—  \+Ax  1^1 
rb-\ 


If  the  particle  is  only  lightly  displaced  and  \b  —  A|  is  large,  then  the  velocity  felt  by  the 
particle  approximates  the  free-stream  velocity  (|u|  ~  U )  in  the  second  integral.  The  second 
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FIXED  FRAME 


COMOVING  FRAME 


Figure  3:  Left:  displacement  of  the  particle  in  a  fixed  frame.  The  particles  starts  in  the 
origin.  Right:  displacement  of  the  particle  in  a  frame  that  moves  with  the  particle.  The 
moving  cylinder  is  in  the  origin. 


integral  can  then  be  approximated  by  A x/U,  which  gives 

A  fb  dx  Ax  fb  dx  A 

U  ~  Jb_x  \u\  U  Jb_x  \u\  U 


From  this  statement  we  can  derive  an  equation  for  the  approximate  horizontal  displacement 
of  the  fluid  particle  Ax: 


Ax  ~  U 


(2) 


The  displacement  in  the  y-direction  is  trivial,  as  the  fluid  particle  will  follow  a  streamline 
of  the  flow  in  the  comoving  frame  (assuming  the  flow  is  steady  in  that  frame).  Far  away 
from  the  moving  cylinder  (A  — >  oo,  b  —  A  -A  — oo)  the  displacement  in  the  y-direction  must 
therefore  be  zero: 


Ay  =  0 


(3) 


Note  that  both  boxed  expressions  are  only  valid  in  the  limit  of  large  path  length  A. 


2.1  Example:  A  cylinder  in  potential  flow 

As  an  example,  consider  a  cylinder  in  potential  flow  with  stream  function 

*(x,y)  =  -Uy(  1-^rV)  (4) 

V  £  +y  / 

with  l  the  radius  of  the  cylinder  and 

ST  ST 

U  =  Sy  ’  V  =  ~dx' 

We  will  choose  U  =  1  and  1  =  1. 
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2.1.1  Far  away  from  the  cylinder 


It  can  be  expected  that  the  effect  of  a  moving  cylinder  on  a  particle  far  away  from  the  body 
is  negligible.  To  test  this  statement,  one  can  construct  a  qualitative  order  of  magnitude  for 
the  displacement  of  such  a  particle.  This  displacement  can  be  represented  by  Ax  and  Ay. 
We  will  assume  the  path  length  is  infinite,  so  the  displacement  Ay  vanishes  identically. 

First  of  all,  let  us  calculate  the  maximum  excursion  in  y,  emax.  This  can  be  calculated 
using  the  stream  function  in  the  fixed  frame  (figure  2).  Here,  the  particle  is  located  initially 
at  (6,  a).  Because  the  flow  around  the  cylinder  is  steady,  the  particle  moves  along  the  stream 
line.  Therefore,  we  can  set  up 

y  a  . 

~y+^T, - 2  =  +  2  ,  U2  -  ~Q’  (5) 

xz  +  y  az  +  bz 

where  a  3>  1  is  assumed  such  that  can  be  neglected.  Let  y  =  a  +  e,  where  a  3>  e.  If 

we  input  this  expression  into  equation  5,  we  obtain 


—a  —  e  + 


a  +  e 


x2  +  (a  +  e)' 


~  —  a  —  e  + 


x2  +  a2 


~  —a . 


Therefore, 

Here,  emax  ~  maxx(e),  thus 


e  ~ 


x2  +  a 2 


0{l/a) 


(6) 

(7) 


where  we  assume  from  now  that  x  <C  a. 

Ax  can  be  calculated  from  equation  2.  First,  we  can  find  u  using  the  given  streamfunc- 
tion  in  the  fixed  frame  and  then  construct  an  approximate  expression  for  |rt|_1  —  1.  The 
velocity  u  is 

dT  x2  —  y2 

11  —  ,  . -  =  -  1  +  - Q  • 

9y  ( x 2  +  y2) 


Therefore,  using  i>i/>  1,  it  follows  that 
1 


-  1  = 


u 


1  - 


1  x2,  —  v2  x2  —  v2 

+  — — *U-i=  x  y 


x*—y* 

(x2-\-y2)2 


( X 2  +  y 


2\2 


(x2  +  y 


2^' 


Let  y  =  a  +  e,  this  gives 


-  1 


u 


x2  —  a2 

t2~\2 


2ae 


(: x 2  +  a2)  (x2  +  a2) 


2^2- 


Therefore, 


r°°  f  i 

J — oo  \  1^1 


Ax  =  /  —  1  )  dx  ~ 


[°°  x2  —  a2  ,  2  ae 

/  - 7T  dx  —  /  - o  dx. 

Loo  ( X 2  +  a2)2  Loo  (x2  +  a2)2 


Here,  the  first  integral  on  the  right  hand  side  is 


/ 


oo  2  _  2  Iff 

dx  =  — r  cos2dde  =  o, 


-oo  (x2  +  a2)^ 


a  J-l 
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where  x  =  atan0  is  used.  The  second  integral  on  the  right  hand  side  is 


2  ae 


L 


-oo  (x2  +  a2) 


2  ae 

2dx  =  ^r 


cos2  6d6  = 

a1 


Therefore, 


Ax  — 


(8) 


Previously,  we  showed  that  e  ~  0(a  x),  which  leads  to  the  conclusion  that  Ax  0(a~ 3). 
Because  a  is  much  larger  than  1 ,  the  displacement  of  the  particle  far  away  from  the  cylinder 
is  negligible,  which  confirms  our  initial  guess.  However,  note  that  the  perturbation  to  the 
velocity  field  due  to  the  moving  object  is  of  order  a-2,  and  the  net  displacement  is  smaller 
by  another  order  of  magnitude.  Thus,  drift  trajectories  far  from  the  moving  body  are  very 
nearly  closed. 


2.1.2  Close  to  the  cylinder 

A  more  interesting  limit  is  the  one  very  close  to  the  cylinder  (a  <C  1).  The  path  of  the  fluid 
particle  can  be  subdivided  into  three  regions  (figure  4).  In  the  first  region  the  particle  more 
or  less  follows  the  x-axis.  In  region  2  the  fluid  trajectory  bends  upwards  and  in  region  3  it 
follows  an  almost  circular  path  to  the  y-axis.  On  the  other  side  of  the  moving  body,  the 
fluid  particle  follows  the  same  path  in  reverse  order. 


Figure  4:  Path  of  a  fluid  particle  close  to  the  cylinder  ( a  -C  1). 


We  want  to  know  the  displacement  Ax  (equation  2).  With  U  =  1  the  integral  for  Ax 
becomes 

T =/ (;  +  1)'iI 


where  we  now  use  the  travel  time  T  instead  of  distance  Ax  to  match  the  units  (T  has  the 
same  numerical  value  as  Ax  for  U  =  1).  We  can  split  the  integral  into  the  three  regions. 
For  the  third  region  one  can  use  the  full  stream  function,  but  for  the  first  two  it  is  easier  to 
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(9a) 

(9b) 

(9c) 


Region  2 

For  region  2  we  first  need  to  specify  the  boundaries  of  the  region.  Let  us  rewrite  the 
approximate  streamfunction  as 

T2  —  -2  (x  -  1)  y  =  -2 XY 

where  X  =  x  —  1  and  Y  =  y.  The  fluid  particle  enters  region  2  at  (Xq.  Lo).  At  that  point 

=  —2XqYq  =  —  (l  —  b~2)  a . 


As  Xq  =  e  and  b  3>  1,  this  is  approximately 

-2elo  =  -a  Y0  =  . 

Because  the  stream  function  is  hyperbolic  and  the  domain  of  region  2  is  of  size  e  by  e,  it 
follows  that  (Xi,Yi)  =  (a/2e,  e).  The  travel  time  in  region  2  is  thus 


T2 


(i  + 1)  dx 


dx 


28 


Figure  5:  Close-up  of  the  path  of  the  fluid  particle  in  region  3. 


Region  3 

From  the  full  stream  function  the  velocity  u  is 


u(x,y) 


d T 

dy 


-1  + 


x 2  —  y2 
(x2  +  y2)' 


In  polar  coordinates  (x  =  r  cos  8,  y  =  rsin#)  this  can  be  written  as 

,  ,  cos  29 

u(r,9)  =  -l  +  —^~ 

Figure  5  shows  a  sketch  of  the  path  of  fluid  particle  in  region  3.  We  define  6\  as  the 
(small)  angle  between  the  x-axis  and  the  point  (1  +  X\ .  Y\).  The  travel  time  in  region  3  is 
then 


t3  = 


r/2 


'9 1 


u 


dx 


,  fn/2  cos  29 


sin0 


<w  =  -i  +  \  log  2  —  ^  log  6\  +  o  (e\) 


using  dx/d9  =  —  rsin0.  From  figure  5  one  can  derive 
,  Y1  e 

tan<'1  =  TTx~i  =  ITf  ~ 

The  travel  time  in  region  3  is  thus  approximately 


e(l - )  =  e  —  a 

e  - 


T3  ~  -1  +  ^log2  -  \  log +  0  (  ( - 


2e 


For  the  displacement  of  the  fluid  particle  we  need  the  total  travel  time,  which  is  twice 
the  sum  of  the  travel  time  in  region  1,2,  and  3.  Adding  up  the  leading  orders  of  the  travel 
times  gives 

T  =  Ti  +  T2  +  T3  =  (l  log  (?)  -  6-1)  +  (-1  log  (^))  +  (-1  +  1  log2  -  1  loge)  . 
The  e’s  cancel  out.  To  leading  order,  the  total  transit  time  for  half  the  particle  path  is  then 

T  =  —  \  logo  -  1  +  |  log 2  —  b~l  .  (10) 


For  small  a,  the  first  term  dominates  the  travel  time,  i.e.,  the  particle  tends  to  get  stuck 
near  the  stagnation  point  in  region  2  for  a  while.  As  a  — >  0,  we  see  that  T  — >  oo.  In  general, 
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for  a  moving  body  of  any  shape  in  2D  potential  flow  it  turns  out  that  the  coefficient  of  log  a 
is  given  by  the  sum  over  the  linearization  coefficients  for  each  hyperbolic  stagnation  point 
encountered,  as  long  as  a  is  small.  The  displacement  of  a  fluid  particle  close  to  a  moving 
body  in  potential  flow  due  to  this  moving  body  is  thus  well-approximated  by 


Aa  (a,  b) 


—  log  a  0  <  b  <  A 
0  otherwise. 


(11) 


The  cutoff  for  b  <  0  and  6  >  A  is  due  to  the  fact  that  since  the  displacement  is  mostly  due 
to  the  stagnation  points,  the  particle  must  at  least  reach  the  stagnation  point.  (We  take 
the  size  of  the  body  itself  as  negligible.) 

3  Displacement  of  a  fluid  particle  due  to  multiple  kicks 

In  previous  section  an  expression  for  Aa  (a,  b)  was  obtained.  Based  on  this  information  we 
can  derive  an  effective  diffusivity.  Here,  we  have  four  constants  and  two  random  variables: 
U  is  the  mean  velocity  of  the  cylinder,  l  is  the  length  scale,  A  is  the  mean  free  path  of 
the  cylinder,  and  n  the  number  density.  The  position  of  the  particle  is  coordinated  by  two 
random  variables,  a  and  b,  which  were  described  in  the  previous  section.  If  we  pick  a  point, 
(x,y),  in  Cartesian  coordinates,  the  probability  measure  of  the  point  is  simply  represented 
as  (1  /V)dxdy,  where  V  is  the  total  volume.  Instead  of  (x,  y)  space,  we  have  to  use  (a,  6) 
space  for  representing  the  PDF  of  the  particle  distribution.  Considering  that  a  is  positive 
and  neglecting  the  edge  of  the  domain,  we  find 

1  2 

—  dxdy  =  —  dadb .  (12) 

A  similar  relation  can  be  found  in  3D.  In  this  case,  the  probability  measure  can  be  expressed 

aS  12 

—  dxdydz  =  dadb .  (13) 

Now,  assume  that  our  target  particle  is  kicked  by  a  swimmer.  For  N  steps  or  kicks, 

N 

XN  =  x0  +  a  {ak,bk)rk,  (14) 

k=l 

where  ak,  b/~,  and  rp-  are  random  variables.  Their  choices  at  each  step  are  independent.  In 
particular,  (fk'^l)  =  Sp;p.  The  direction  of  a  swimmer  is  chosen  independently.  Without  loss 
of  generality,  we  can  assume  that  xq  =  0.  Since  (rk)  =  0,  it  follows  from  equation  (14)  that 
(xn)  =  0.  To  calculate  the  effective  diffusivity,  the  second  moment  must  be  considered: 

N 

(|*Ar|2)  =  £<A  («fc)  bk))  +  vanishing  cross  terms. 
k= l 

The  cross  terms  vanish  because  {ry.  ■  r{)  =  5ki ■  Each  step  of  the  target  particle  caused  by 
the  swimmers  is  independent  such  that  the  particle  movement  can  be  considered  to  be  a 
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random  walk.  Since  the  variables  are  identically  distributed,  we  get 


v— ^  N  f  r 

(ak,  bk)  hh)  =  N(AX  (a,  b))  =  —  /  /  A2  (<b  b)  2 dadb. 

k= 1 


N  is  introduced  as  the  number  of  time  steps.  Physically,  N  can  be  understood  as  the 
number  of  collisions  between  the  particle  and  the  swimmers.  It  can  be  represented  as  t/T, 
where  t  is  an  elapsed  time  and  T  is  the  mean  free  time,  the  average  time  for  a  swimmer  to 
collide  with  the  particle.  Hence,  t/T  is  the  average  number  of  collisions  during  the  time  t. 
Here,  T  can  be  represented  as  A /U,  where  A  is  the  mean  length  which  a  swimmer  proceeds 
before  changing  its  direction,  so  that  N  can  be  written  as  Ut/X.  Hence, 

(|aV)|2>  =  YV  JJ  (a'h^2dadb' 

where  x(t)  is  used  instead  of  xn-  To  eliminate  V  from  the  equation,  observe  that  the  above 
equation  is  for  one  swimmer,  so  n  =  1/V  is  the  number  density.  Therefore, 

( |m(t) | 2)  =  - -j—  J j  A2x(a,b)  dadb  =  Ant,  (15) 

where  k  is  the  effective  diffusivity.  Ant  comes  from  the  result  of  random  walk  (see  previous 
lecture).  The  above  expression  is  for  a  two-dimensional  system,  but  can  be  easily  generalized 
for  3D.  We  obtain  finally 


(16) 


Recall  our  approximate  form  for  displacement  due  to  a  cylinder,  including  dimensions: 


f  TX  I  Aa  (a>  b)  dadb, 

2D 

H\j  -  \ 

\  TUX  I  tX2x(a,b)  adadb, 

3  D 

(  —l\og(a/l)  0<b<\ 

\  negligible  otherwise  ’ 


the  calculation  of  k  becomes 

Un 

k  ~  — —  A  /  l2  log2  (a /l) da  . 

2A  J  o 

Using  fj  log2  xdx  =  2,  we  obtain 

k  ~  Uni3  .  (17) 

Note  that  this  result  is  completely  independent  of  A  (for  A  large). 

Another  relevant  case  is  a  swimmer  with  a  ‘bubble  wake,’  that  is,  a  region  enclosed 
by  streamlines  that  follows  the  swimmer,  often  called  the  atmosphere  of  the  swimmer.  If 
there  is  a  bubble  wake  behind  a  swimmer,  a  particle  inside  the  bubble  wake  will  follow  the 
swimmer  until  the  swimmer  changes  its  direction.  Therefore,  the  particle  trapped  inside 
the  bubble  moves  by  A.  Our  approximate  expression  of  Aa  (a,  b)  is 


f  A  0  <  b  <  A 
(  0  otherwise. 


(18) 
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Swimmer 

A-dependence 

far/near  field  dominance 

potential  (slip) 

none 

near 

viscous  (squirmer) 

none 

far 

viscous  (no-slip) 

log  A 

near 

trapped  bubble 

A 

near 

Table  1:  Dependence  of  displacement  on  path  length  and  whether  the  displacement  is 
dominated  by  far  or  near  field,  for  various  swimmer  models. 


In  this  case,  our  calculation  of  n  is 

6k  =  ‘—r—  [  \2dadb  =  UnAVbubbic  > 

^  J  bubble  interior 

where  Vbubbie  is  the  total  volume  of  the  bubble.  Therefore, 

k  =  gDnAVbubbie  •  (19) 

Now,  unlike  the  potential  flow  case,  the  effective  diffusivity  depends  explictly  on  path  length 
A.  This  k  can  be  much  larger  than  that  for  untrapped  fluid. 

More  complexity  could  come  in  if  we  consider  viscous  swimmers  with  a  boundary  layer. 
In  this  case,  some  simple  estimates  [9]  suggest  k  ~  log  A.  For  micro-organisms  in  the 
Stokes  flow  approximation,  the  transport  is  larger  and  is  dominated  by  far-field  (a  few 
body  lengths)  hydrodynamics  [6].  The  dependence  on  flow  characteristics  and  boundary 
conditions  is  summarized  in  Table  1,  but  it  should  be  noted  that  the  details  of  this  table 
are  still  the  object  of  active  research. 
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1  Introduction 

In  this  lecture  we  will  try  to  understand  local  or  Lagrangian  theories  involved  in  mixing 
of  passive  scalars.  This  involves  solving  the  advection-diffusion  (AD)  equation  along  fluid 
trajectories.  The  origins  of  these  local  theories  can  be  traced  to  Batchelor’s  idea  of  describing 
the  flow  via  spatially-constant  strain-rate  matrices  with  prescribed  time  dependence  [2]. 
Kraichnan  addressed  the  problem  next  by  considering  the  velocity  field  in  the  AD  equation 
to  be  a  stochastic  Gaussian  field  with  a  time  correlation  that  decays  infinitely  rapidly  (or 
is  white  in  time)  and  a  spatial  correlation  that  has  a  power-law  structure  [6].  This  led 
to  solving  a  stochastic  differential  equation.  Zeldovich  encountered  the  problem  in  the 
context  of  heat  diffusion  and  the  magnetic  dynamo  and  adopted  a  random  matrix  theory 
approach  [13-15].  More  recently,  tools  from  large  deviation  theory  and  path  integration 
have  aided  in  obtaining  a  complete  solution  of  the  problem,  as  will  be  discussed  in  this 
lecture. 

Let  us  revisit  the  AD  equation, 

dte  +  u  ■  VO  =  /tV2(9,  n  =  M2  or  M3.  (1) 

As  mentioned  before  6(x,t)  denotes  a  passive  scalar  field  (say  concentration  of  dye)  and  u(x,t ) 
is  the  ambient  velocity  field  which  stirs  the  scalar  as  it  diffuses  with  a  diffusivity  k. 

Previously  (Lecture  1)  we  have  examined  how  a  “patch”  of  dye  evolves  in  an  extensional 
flow,  u  =  ( Xx ,  —  Ay).  We  will  now  address  the  evolution  of  a  passive  scalar  when  acted  upon 
by  an  arbitrary  linear  flow.  By  definition,  for  a  linear  flow  the  velocity  field  depends  linearly 
on  the  position.  Rigid  body  rotation,  extensional  flow,  and  simple  shear  are  some  classic 
examples  of  linear  flow.  In  general, 

u(x,t)  =  U(t)  +  x  ■  A(t),  Tr(A)  =  0,  (2) 

where  A(t )  is  the  velocity  gradient  tensor.  The  requirement  for  it  to  be  traceless  is  a  conse¬ 
quence  of  incompressibility  of  the  flow  field  (V  •  u  =  0).  For  turbulent  or  Lagrangian-chaotic 
flows,  A(f)  is  a  random  matrix,  having  a  finite  correlation  time,  which  is  the  Lagrangian 
correlation  time  of  the  velocity.  Here  U ( t )  is  a  uniform  flow.  The  advantage  of  using  linear 
flows  lies  in  making  the  problem  analytically  tractable  and  most  importantly  it  serves  as  a 
nice  prototype  for  more  complicated  flows.  We  will  simplify  things  further  by  considering 
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(a)  (b) 


Figure  1:  Advection  of  a  concentration  field  by  a  simple  shear  flow,  u  =  (Ux,  Uy )  =  (7?/,  0) 
(neglecting  diffusion,  7  assumed  constant).  An  intially  isotropic  concentration  field  (a)  gets 
both  stretched  and  rotated  by  the  background  flow  (b). 

the  limit  of  high  Schmidt  number,  which  is  a  dimensionless  quantity  defined  as 

Sc  :=  v / k 

where  v  is  the  kinematic  viscosity  of  the  fluid  and  k  as  already  mentioned  is  the  diffusivity 
of  the  scalar.  (If  the  scalar  is  heat,  then  the  Schmidt  number  becomes  the  Prandtl  number.) 
Momentum  diffuses  significantly  faster  than  the  passive  scalar  for  flows  with  large  Sc.  This  is 
the  desired  limit  for  chaotic  advection:  Sc  3>  1  ensures  that  the  velocity  field  appears  much 
smoother  than  the  passive  scalar  concentration,  allowing  the  linear  local  approximation  (2). 
This  is  the  regime  that  was  studied  by  Batchelor  and  leads  to  the  celebrated  Batchelor 
spectrum.  For  Sc  <  1  we  have  the  Kolmogorov-Obukhov-Corrsin  regime,  where  the  scalar 
gets  advected  by  the  turbulent  inertial  range,  unlike  the  Batchelor  regime.  (We  will  not  be 
discussing  the  KOC  regime  in  this  lecture.) 

2  ‘Inertia  tensor’  of  a  patch  of  passive  scalar 

It  is  of  great  interest  to  understand  the  response  of  a  passive  scalar  to  a  flow  with  chaotic 
Lagrangian  trajectories.  As  already  discussed,  a  decent  approximation  to  such  a  velocity 
field  can  be  made  by  superposing  random  linear  flows  (in  the  limit  Sc  >  1).  In  order 
to  get  a  statistical  description  of  the  passive  scalar  held  subjected  to  such  an  ensemble  of 
linear  hows,  we  need  to  understand  the  behaviour  of  the  moments  of  9{x,t).  We  use  angle 
brackets  to  denote  the  volume-integrated  value  of  a  quantity  (similar  to  an  average), 

(/)=  [  fdV. 

J  Q 

We  have  already  seen  (Lecture  1)  that  when  the  domain  is  either  periodic  or  is  bounded  by 
insulating  walls  then 

dt(9)  =  0,  (3) 
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that  is,  the  mean  value  of  9  is  conserved.  This  is  a  consequence  of  the  no-net-flux  boundary 
condition  imposed  on  the  system.  To  track  the  evolution  of  ‘a  blob  of  dye,’  one  can  define 
the  ‘center  of  mass’  of  9  as 

_  (xtf) 

Ci~  (0) 

To  obtain  an  evolution  equation  for  c  we  consider  the  first  moment  of  the  AD  equation, 
and  integrate  by  parts  a  few  times: 


dt(xi0)  +  (XiV  •(([/  +  x  •  A)9))  =  K(XiV29 ), 

=>•  dt(xi9)  -  (( Uj  +  xiAej)9djXi )  =  0, 

=>-  dt{xi9)  -  Ui{9)  -  Agi(d)c£  =  0,  (djXi  =  (%) 


which  leads  to 


c  =  U  +  c  •  A. 


(5) 


Here  the  overdot  represents  a  derivative  with  respect  to  time.  In  the  calculation  the  contri¬ 
bution  due  to  the  dissipative  term  disappeared  due  to  the  no-net-flux  boundary  conditions. 
Thus  the  center  of  mass  of  the  patch  gets  advected  by  the  background  flow. 

Proceeding  to  the  calculation  for  the  next  moment  we  define  the  ‘inertia  tensor’  of  the 
patch  as 


Mij  = 


{ XiXj9 ) 

~~ w  CiCJ ' 


The  additional  —CiCj  term  makes  M  translationally  invariant.  By  construction  M  is  also 
a  symmetric  matrix.  We  proceed  in  similar  fashion  as  above  to  obtain  the  corresponding 
evolution  equation  for  M. 


dt(xiXj9 )  +  (xiXjV  ■  (( U  +  x  ■  A) 9))  =  n{xiXjW29) 

Oj  (xjXj  9  j  Uj(xi9)  Uj  ( x  j  9 )  Apj(xiXp9)  Apt  \  X-jXpO)  —  2KSpj  (9'j 

—t"  (h  T  CiCj )  Ujd  UiCj  -Apj  ( Mip  T  c} cp j  Api  (Adjp  T  CjCp )  —  2 KSij 

=t>  Oj  Mij  -)-  Cj  ( Of  Cj  Uj  CpApj)  -\-  Cj  (Otd  Ui  CpApi}  =  MipApj  T  ApiMjp  T  2  k5^ 


so  that  finally 

M  =  M  ■  A  +  A]  ■  M  +  2kI  .  (6) 

Thus  we  have  obtained  an  evolution  equation  for  the  ‘inertia  tensor’  of  the  patch.  Here  A t 
denotes  the  transpose  of  the  velocity  gradient  tensor. 

The  solution  for  c(t )  and  M  ( t )  can  be  explicitly  written  as 

.  pt 

c(t)  =  c( 0)  •  efo  A(rOdTi  +  /  [/(T)  .  eU  A(n)dndr  ?  (7) 

Jo 

r>{ 

M(t )  =  eJo  Af(n)dn  .  +2f,  gf*  At(n)dn  .  e/*  Mn)dndr  .  (8) 

Jo 

Here  the  quantity  eB  is  a  matrix  exponential,  defined  by  the  usual  power  series 

eB  =  I  +  B +  \B  B  +  .... 
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It  should  be  noted  that  in  the  expression  (8)  for  M(t )  the  two  matrix  exponentials  cannot 
be  combined  and  written  as  efT(A^+A)(Tl')dT1 .  For  matrix  exponentials,  eBl+B2  =  eBl  ■  eB2 
only  if  the  matrices  B\  and  B2  commute.  The  velocity  gradient  matrix,  A,  is  typically  a 
non- normal  matrix  (AA^  7^  A^  A). 

For  a  simple  shear  flow,  we  have  u  =  (jy,  0)  and 


A  = 


0 

7 


Since  A  is  nilpotent  ( A 2 
representation,  so  that 


0),  the  matrix  exponential  has  only  two  terms  in  its  power  series 


eAt  =  I  +  At  = 


1  0 

jt  1 


3  Computation  of  Lyapunov  exponents 

Since  M  is  a  symmetric  matrix  the  eigendecomposition  takes  the  simple  form 

M  =  RDR]  (9) 

where  R  is  an  orthogonal  matrix  (the  columns  of  which  are  eigenvectors  of  A/),  and  D  is 
real  and  diagonal  (having  the  eigenvalues  of  M  on  the  diagonal).  When  the  eigenvalues  are 
sufficiently  distinct,  the  radial  and  angular  degrees  of  freedom  decouple  from  each  other,  as 
we  will  now  show.  (Our  derivation  parallels  that  of  [1].)  Let  us  substitute  the  expression 
(9)  in  the  evolution  equation  (8): 

RDR)  +  RDitf  +  RDR f  =  RDR*A  +  A^RDR?  +  2k/, 

R^RD  +  DR^R  +  D  =  DR^  AR  +  R*  A^  RD  +  2k/ .  (10) 

Since 

f  (R^R)  =  &R  +  R)R  =  0 
at 

both  R^  R  and  R  '  R  are  antisymmetric  matrices.  Thus  in  equation  (10)  the  first  two  terms 
on  the  left  do  not  have  any  diagonal  elements.  Denoting  A  =  R^  AR  as  the  rotated  velocity 
gradient  matrix  we  have  the  following  equation  for  the  diagonal  elements, 

B%i  =  2A uDa  +  2k  , 

where  repeated  indices  do  not  indicate  summation.  If  we  assume  Da  =  e2pi  (pi  >  0  denotes 
extension  whereas  pi  <  0  represents  compression)  then  one  has 

Pi  =  An  +  ne~2pi  .  (11) 

Without  loss  of  generality,  we  arrange  the  eigenvalues  e2pl ,  e2p 2 , . . . ,  e2pd  in  non-increasing 
order: 

Pi  >  P2  ■  ■  ■  >  Pd  (12) 

where  d  is  the  dimension  of  space. 
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Extension 
e.  2"  «1 


Compression 

e2p<»\ 


Figure  2:  The  diffusive  term  (ne  2pi)  in  the  equation  for  pi  would  be  negligible  along  the 
extensional  axis  compared  to  the  compressional  direction. 


Having  worked  with  the  diagonal  matrix,  D ,  let  us  focus  on  the  eigenvector  matrix,  R. 
Considering  the  off-diagonal  terms  of  equation  (10), 

[Rj  RD]ij  =  [R^  R]n>D£j  =  [R^  R]ijDjj 
[DR^R]ij  =  Dn[R^R]£j  =  ~[R)  R\ijDa 

where  repeated  indices  do  not  indicate  summation.  Let  us  define  RJ  R  =  Q,  an  antisym¬ 
metric  matrix.  Thus  we  have, 


( Rjj  —  Dii-Aij  T  AjiDjj 

e2piAij  +  e2pi  Aji 


H ij  — 


pj  —  g2  pi 


i  +  j 


(13) 


Let  us  assume  there  exists  scale  separation  in  p’s  (pi  P2  Pd)-1  Thus  if  pt  pj, 

the  t  1  solution  is 


n 


*? 


_ 4. . 
A  ■■ 

3%i 


i  <  j 
i  >  j 


(14) 


Thus  A  is  independent  of  the  eigenvalues  and  we  can  solve  equation  (11)  directly, 


Pi(t)  =  pi( 0)  +  Ai(t)  +  \  log 


where  Ai(t)  =  /  Ah(t)cIt  . 


When  diffusion  vanishes  (k  =  0),  this  reduces  to 


1  +  2ne  2pi(0)  /  e  2Aiif') dt ' 

Jo 


(15) 


Pi(t)  =  pi( 0)  +  /  Ah(t)(It  . 

Jo 


(16) 


1This  is  almost  always  true  for  long  times,  but  it  may  fail  locally  or  due  to  a  symmetry  in  the  problem. 
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The  k  =  0  case  can  be  used  to  compute  Lyapunov  exponents2  from  the  eigenvalues  of  M: 


A  i  =  lim 


log(ePi^) 

t 


lim 

£—>•00 


Pi(t) 

t 


(17) 


The  convergence  of  the  above  limit  is  assured  by  Oseledec’s  multiplicative  ergodic  theo¬ 
rem  [7]. 

A  blob  of  initial  size  L  will  become  an  ellipsoid  with  the  length  of  its  main  axis  changing 
as  exp(Aji)  along  directions  corresponding  to  positive  and  negative  Lyapunov  exponents. 
The  rate  of  decrease  of  the  smallest  dimension  is  determined  by  A^-  Because  of  diffu¬ 
sion,  the  contraction  terminates  after  a  time,  tK  «  | A^| — 1  log(772 1 A^ | / ac) ,  when  the  diffusive 
lengthscale,  rK  =  \J K/\Xd\  is  attained. 


4  Large-deviation  form  of  the  central  limit  theorem3 

To  form  a  probabilistic  theory  of  mixing,  we  make  A,  and  therefore  pi  and  p-2  (we  restrict 
to  d  =  2  dimensions  from  now),  into  random  variables  and  deduce  the  approximate  long¬ 
time  moments  of  concentration.  To  do  so,  we  require  a  central  limit  theorem  (CLT)  which 
is  also  valid  for  large  deviations. 

Let  Xi  be  i.i.d.  (independent  and  identically  distributed)  random  variables  with  finite 
mean  x  and  variance  a2.  Let  x  be  the  mean  of  n  such  x.L.  Roughly  speaking,  the  classical 
CLT  asserts  that 

\  /  n(x  —  x')2  A 

P(x,  n)  ~  .  exp - ^ -  when  n  3>  1  and  x  —  x  <  -7=. 

v  '  V^W2  V  2  a2  )  ^ 

The  theorem  applies  to  large  n,  but  the  condition  that  x—x  <  ^  means  that  as  n  gets  large, 
the  region  around  the  mean  to  which  the  theorem  applies  grows  increasingly  narrow.  If  one 
is  concerned  about  the  tails  of  the  distribution,  the  theorem  is  not  powerful  enough.  This 
is  not  surprising  since  it  contains  only  second-moment  information  about  the  Xj  variables, 
and  higher  moments  may  be  critical  at  the  tails.  In  the  theory  of  mixing,  the  tails  of  the 
p  PDF’s  are  indeed  significant,  so  we  must  use  the  large  deviation  form  of  the  CLT,  which 
is  stronger  and  uses  information  from  all  moments  of  the  Xi  variables.  The  significance  of 
the  tails  will  be  evident  a  posteriori  from  the  analysis  using  the  large  deviation  CLT.  The 
large  deviation  CLT  asserts  that 

P(x,  n)  ~  exp  (— nS(x  —  x)) , 

where  S  is  the  Cramer  function  (or  rate  function,  or  entropy  function)  determined  by  the 
PDF  p(xi).  (In  these  notes  we  typically  neglect  algebraic  prefactors  in  asymptotic  relations, 
such  as  the  one  above.)  Whatever  the  form  of  p(xi),  S  is  convex,  and  5(0)  and  5'(0)  vanish. 
Taylor  expanding  5  about  0  in  the  large  deviation  CLT,  the  first  nonzero  term  is  quadratic 

2Lyapunov  exponents  describe  the  rate  at  which  infinitesimally  close  trajectories  in  a  dynamical  system 
diverge  from  each  other.  A  chaotic  system  is  usually  indicated  by  a  positive  infinite-time  Lyapunov  exponent. 

3The  ‘facts’  invoked  in  this  section  come  from  theorems  which  for  the  most  part  give  bounds,  not  equali¬ 
ties.  We  are  leaving  out  many  details  that  are  necessary  to  make  the  theory  rigorous.  See  for  example  [3,8,11] 
for  more  details. 
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in  x  —  x.  Truncating  after  this  term  recovers  the  classical  form  of  the  CLT,  so  the  classical 
form  simply  approximates  the  large  deviation  form  to  the  first  nontrivial  order.  The  explicit 
expression  for  the  Cramer  function  in  terms  of  p(xi)  involves  forward  and  inverse  Fourier 
transforms,  which  generally  cannot  be  explicitly  evaluated.  Rather  than  prove  the  large- 
deviation  CLT  or  derive  the  general  expression  for  the  Cramer  function  (see  e.g.  [3,8,10,11]), 
we  shall  merely  derive  the  Cramer  function  explicitly  in  the  simple  case  where  the  Xi  have 
a  Bernoulli  distribution,  though  the  general  formula  for  S  may  be  derived  by  an  analogous 
procedure. 


5  Large  deviation  CLT  for  the  Bernoulli  distribution 

Let  Xi  have  the  PDF  of  a  fair  coin, 

p(xi )  =  \6(xi  +  1)  +  \5(xi  -  1). 

The  characteristic  function  of  a  random  variable  is  the  Fourier  transform  of  its  PDF,  gen¬ 
erally  written  in  the  form  e~s^k\  We  wish  to  derive  p(x,n),  the  PDF  of  the  mean  x  of  n 
random  variables  a We  are  given  p(xi),  not  p(x,  n),  but  we  can  express  the  latter  in  terms 
of  the  former.  Because  the  xt  are  independent,  the  joint  distribution  of  x±,  ...,xn  is  simply 
the  product  of  their  individual  PDF’s,  so 

/n 

p{x  1,  ...,xn)s(x  -  \  y,  Xi)  dx  1  •  •  •  dxn 

i= l 

/n 

p(x i)  •  •  •  p(xn)5 (x  -  l  -  x)dxi  ■  ■  ■  dxn. 

i= i 

It  is  hard  to  compute  the  PDF  directly  from  the  PDF  of  Xi  by  the  above  formula.  Rather, 
we  shall  compute  the  generating  function  of  x,  and  then  its  PDF.  This  is  easier  because 
integrating  out  x-dependence  allows  us  to  then  integrate  out  the  Xi  variables.  The  generating 
function  of  x  is 

e-5(A.)  _  f  p^x^n^e-lkx(lx 


=  /  P(x i)  •  •  -p(xn)8 (x  -  \ 


Xj  e 


—ikx 


dx i • •  •  dxndx 


i— 1 


=  J p{x l)  •  •  •  p(xn)  exp  (  -  ^  x]dxi  •  •  •  dxr 

n  „ 

=  Y[  p(a 


i= 1 


ikx ^ 

[Xi  dxj 


1=1  ' 


p(x i)e  ™  dx i 

(s  J  +  1)  +  ""  l))e__srL dxi'j 


=  cos 


n  I  k 


(knY 
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We  recover  the  PDF  of  x  by  taking  the  inverse  Fourier  transform  of  its  generating  function: 

P(x,n)  =  ±  j  cos n(±)eikxdk 

=  i/exp(„ln[cos©]+to)* 

=  ^J  exp  (n[ln(cos  K)  +  iKxfjdK, 

where  K  =  K  The  last  form  of  the  above  integral  suggests  the  stationary  phase  approx¬ 
imation,  since  the  frequency  of  the  oscillatory  part  of  the  integrand  goes  to  infinity  as  n 
does.  Omitting  the  details,  the  stationary  phase  approximation  yields 

S(x)  ~  ~l(x  +  1)  In  (^f)  +  ln(l  -  x). 

The  classical  form  the  the  CLT  would  give  only 

p(x,n )  ~  exp  (  —  yS"(0)(x  —  x)2) 

~  eXP  {  -  ~2JW~ )  ■ 

In  this  example,  the  classical  CLT  overestimates  the  weight  of  the  tails  of  p(x,n),  though 
typically  the  opposite  is  the  case.  In  any  case,  the  large  deviation  form  is  needed  for  accuracy 
at  the  tails. 

6  Large  deviation  CLT  applied  to  mixing 

Recall  that  p2  has  generally  become  quite  small  at  long  times.  However,  in  a  fluid  with 
diffusion,  the  p-2  dimension  eventually  becomes  small  enough  that  further  contraction  by 
stretching  is  balanced  by  diffusion.  This  is  the  Batchelor  width,  \J ac/Ai,  beyond  which  the 
scalar  blob  will  contract  no  further.  As  such  late  times  we  can  assume  that  the  PDF  of  p2 
is  stationary  in  time,  so  as  t  — >  oo, 

P(pi,P2,t)  ~  exp  (  -  ts(^  -  Ai))pstat(P2)- 

The  exact  stationary  distribution  of  p2  is  irrelevant  to  the  heuristics  of  this  theory.  Suppose 
that  the  concentration  at  each  point  in  a  blob  of  scalar  is  inversely  proportional  to  the 
blob’s  total  volume,  so 

0(x,t)  ~  (det  M)1/2  =  e"(pl+p2). 

This  is  an  accurate  assumption  for  a  Gaussian  blob,  for  instance.  We  now  combine  the 
above  approximations  to  estimate  the  expectation  of  9a . 

(0a)(t)  ~  C(t)  j  e-“(P!+P2)  exp  ^  _  tS(^  -  Ai) Pstat(p2)dpidp2 

~  J  e~api  exp  —  —  X\)^jdpi, 
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Figure  3:  Generic  S(x)  and  S(h  —  Ai).  When  a  >  ac,  there  is  no  positive  h*  such  that 
S'(h*  —  A)  =  — a ,  so  we  must  choose  h*  =  0  for  the  saddle-point  approximation. 

where  C (t)  is  a  normalization  that  is  algebraic  in  t  and  is  therefore  asymptotically  dominated 
by  the  integral,  so  we  have  ignored  it  along  with  whatever  constant  emerges  from  integrating 
over  p2-  Letting  h  =  ^L, 

(6°){t)  ~  J  e~t(<ah+S{h-Xl))  dh  =  J  e~tHWdh , 

where  H(h)  =  ah  +  S(h  —  A).  This  integral  may  be  approximated  by  the  saddle-point 
method  for  large  t. 

The  saddle-point  method  typically  asserts  that 

(9a){t)  ~  e-W*, 

where  h*  is  such  that  H'(h*)  =  0,  i.e.  such  that  S'(h*  —  Ai)  =  —a.  In  the  present  case,  it  is 
not  always  possible  to  choose  such  an  h*  because  h  must  be  nonnegative.  This  is  clear  from 
the  generic  S  and  S'(h  —  A)  curves  plotted  in  Figure  6.  Whatever  the  exact  form  of  the 
Cramer  function,  S' (h)  begins  at  the  origin  and  increases  monotonically  in  h,  so  S' (h  —  A) 
begins  at  some  negative  point  (0,  —  ac)  and  increases  monotonically  in  h.  When  a  <  ac, 
H'(h*)  =  0  is  possible,  but  when  a  >  ac,  the  best  we  can  do  is  to  choose  h*  =  0.  That  is, 

(9a){t)  ~  e-t{ah*+s'(h*~x^) , 

where  h*  =  max{0,  A  +  (<S")_1(— a)}.  Neglecting  large  deviations,  the  classical  CLT  would 
have  given  a  decay  rate  of  — aA,  which  is  an  upper  bound  on  the  decay  rate  we  derived.  In 
fact,  the  tails  of  the  PDF  of  pi  act  significantly  to  slow  the  decay  of  concentration.  Above 
ac ,  ( 9a)(t )  is  constant  in  a.  We  may  understand  this  plateauing  as  high  concentration 
blobs  dominating  the  expectation  for  large  a.  These  are  the  blobs  that  have  experienced  no 
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stretching,  so  a  >  ac  moments  are  determined  solely  by  realizations  of  6  without  stretching. 
Note  that  the  plateauing  of  moments  also  means  that  the  La  norm  of  6  vanishes  as  a  — >  oo, 
which  is  characteristic  of  intermittency. 


References 

[1]  E.  Balkovsky  and  A.  Fouxon,  Universal  long-time  properties  of  Lagrangian  statis¬ 
tics  in  the  Batchelor  regime  and  their  application  to  the  passive  scalar  problem,  Phys. 
Rev.  E,  60  (1999),  pp.  4164-4174. 

[2]  G.  K.  Batchelor,  Small-scale  variation  of  connected  quantities  like  temperature  in 
turbulent  fluid:  Part  1.  General  discussion  and  the  case  of  small  conductivity,  J.  Fluid 
Mech.,  5  (1959),  pp.  113-133. 

[3]  R.  S.  Ellis,  Entropy,  Large  Deviations,  and  Statistical  Mechanics,  Springer- Verlag, 
New  York,  1985. 

[4]  G.  Falkovich,  K.  Gawedzki,  and  M.  Vergassola,  Particles  and  fields  in  turbu¬ 
lence,  Rev.  Mod.  Phys.,  73  (2001),  pp.  913-975. 

[5]  P.  H.  Haynes  and  J.  Vanneste,  What  controls  the  decay  of  passive  scalars  in  smooth 
flows?,  Phys.  Fluids,  17  (2005),  p.  097103. 

[6]  R.  H.  Kraichnan,  Small-scale  structure  of  a  scalar  field  convected  by  turbulence, 
Phys.  Fluids,  11  (1968),  pp.  945-953. 

[7]  V.  I.  Oseledec,  A  multiplicative  theorem:  Lyapunov  characteristic  numbers  for  dy¬ 
namical  systems,  Trans.  Moscow  Math.  Soc.,  19  (1968),  pp.  197-231. 

[8]  A.  Schwartz  and  A.  Weiss,  Large  Deviations  for  Performance  Analysis,  Chapman 
&  Hall,  London,  1995. 

[9]  J.  Sukhatme  and  R.  T.  Pierrehumbert,  Decay  of  passive  scalars  under  the  action 
of  single  scale  smooth  velocity  fields  in  bounded  two-dimensional  domains:  From  non- 
self-similar  probability  distribution  functions  to  self-similar  eigenmodes,  Phys.  Rev.  E, 
66  (2002),  p.  056032. 

[10]  J.-L.  Thiffeault,  Scalar  decay  in  chaotic  mixing,  in  Transport  and  Mixing  in  Geo¬ 
physical  Flows,  J.  B.  Weiss  and  A.  Provenzale,  eds.,  vol.  744  of  Lecture  Notes  in 
Physics,  Berlin,  2008,  Springer,  pp.  3-35. 

[11]  S.  R.  S.  Varadhan,  Large  deviations  and  applications ,  Society  for  Industrial  and 
Applied  Mathematics,  Philadelphia,  PA,  1984. 

[12]  W.  R.  Young,  Stirring  and  mixing,  in  Proceedings  of  the  1999  Sum¬ 
mer  Program  in  Geophysical  Fluid  Dynamics,  J.-L.  Thiffeault  and  C.  Pas- 
quero,  eds.,  Woods  Hole,  MA,  1999,  Woods  Hole  Oceanographic  Institution, 
http : //gf d. whoi . edu/proceedings/1999/PDFvoll999 .html. 


43 


[13]  Y.  B.  Zeldovich,  The  asymptotic  law  of  heat  transfer  at  small  velocities  in  the  finite 
domain  problem ,  Zhurnal  eksperimentalnoi'  i  teoreticheskoi’  fiziki,  7  (1937),  pp.  1466- 


1468. 


[14]  - ,  Exact  solution  to  the  problem  of  diffusion  in  a  periodic  velocity  field  and  turbulent 

diffusion ,  Doklady  Akademiia  Nauk  SSSR,  266  (1982),  pp.  821-826. 

[15]  Y.  B.  Zeldovich,  A.  A.  Ruzmaikin,  S.  A.  Molchanov,  and  D.  D.  Sokoloff, 
Kinematic  dynamo  problem  in  a  linear  velocity  field,  J.  Fluid  Mech.,  144  (1984),  pp.  1- 

11. 


44 


Social  Behavior,  Mixing,  and  the  Evolution  of  Schooling 

Glenn  Flier  1 

Notes  by  Kiori  Obuse  and  Emma  Boland  (nee  Thompson) 

May  11,  2011 


1  Introduction 

Swimming  organisms  adjust  their  behavior  in  response  to  environmental  conditions  and  form 
structures  such  as  patches,  swarms,  and  schools.  Disadvantages  of  these  tendencies  include 
tougher  competition  for  food,  the  attraction  of  predators;  some  hydrodynamic  disadvantages 
-  turbulent  wakes  disrupt  movement,  and  more  energy  is  required  to  swim,  etc.  There  are 
several  advantages  however,  such  as  enhanced  reproduction,  predator  avoidance,  and  the 
easier  search  for  prey;  some  hydrodynamic  advantages  -  extra  turbulence  brings  higher 
encounter  rates,  and  coordinated  swimming.  Here,  we  study  a  model  for  such  organisms’ 
behavior. 

2  Lagrangian  Dynamics 

In  order  to  consider  the  model  mentioned  above,  we  start  with  the  Lagrangian  dynamics  of 
an  individual  or  molecule  of  fluid.  The  stochastic  differential  equations  for  the  position  X 
and  velocity  U  are 

dXi  =  Uidt ,  (1) 

dUi  =  Aidt  +  fiijdWj,  (2) 

where  A  is  the  acceleration  produced  by  deterministic,  or  large-scale,  forces,  fiijdWj  is  the 
random  acceleration,  with  the  random  increment  dWj  satisfying  ( dWi )  =  0  and  (dWt  dWj)  = 
Sijdt,  and  (3ij  will  be  a  diagonal  matrix.  The  equations  (1)  and  (2)  also  hold  for  continuous 
X  and  discrete  U.  As  an  example,  consider  a  drag  law  for  the  acceleration 

A  =  — r(U  —  u),  (3) 

with  u  being  the  water  velocity  and  =  f35ij.  Then  the  dispersion  is  determined  by  /3  and 
r;  from  the  equations,  by  assuming  that  the  statistics  become  time-independent  for  large  t, 
( Ui(t ))  =  (Ui(t  +  dt))  in  the  long-time  limit  or  solving  the  equations  (1)  and  (2)  by  utilizing 
Ito’s  lemma  (see,  for  example,  [2]),  we  can  show  that 

(Ui)  ->  Ui,  (4) 

((Ui  -  Ui)(Uj  -  uj ))  ->•  ^<%,  (5) 

(Xi(t)Xj(t))  -»•  (Xj(O)Xj(O)}  +  (6) 
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t  =  0.  000 


r  =  1.  B  -  0.  5,  u  =  0.  5.  t  =  30.  000 


(a) 


(b) 


r=  1.  jS  =  0.  25.  u  =  0.  5.  t  =  30.  000  r  =  2,  jB  =  0. 5.  u  =  0.  5.  t  =  30.  000 


X 


X 


Figure  1:  Distributions  produced  by  numerical  simulation  of  Lagrangian  particles  from 
equations  (1,2),  showing  (a)  the  initial  conditions  for  all  cases,  (b)  r  =  l,/3  =  0.5, u  = 
0.5,  t  =  30,  (c)  r  =  l,/3  =  0.25,  u  =  0.5,  t  =  30,  (d)  r  =  2,/3  =  0.5,  u  =  0.5,  t  =  30. 


Figure  2:  Histogram  of  velocities  from  numerical  simulations  of  Lagrangian  particles  with 
the  parameters  (a)  r  =  l,/3  =  0.5,  u  =  0.5,  t  =  30  (b)  r  =  l,/3  =  0.5,  u  =  0.25,  t  =  300  (c) 
r  =  2,/3  =  0.5,  u  =  0.5,  t  =  30.  The  blue  and  the  green  lines  in  each  panel  represent  the 
velocities  in  the  x-  and  y-directions,  respectively. 
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As  diffusion  coefficient  n  is  k  =  ( dXidXj )  /(2 dt),  the  latter  corresponding  to  a  diffusivity 
of  k  =  /32/2r2,  which  means  that  the  area  grows  like  4 nt  and  the  velocity  variance  is  rn. 

Figure  1  contains  snapshots  of  the  distribution  of  particles  for  different  combinations  of 
the  parameters  r  and  /3.  We  can  see  that  the  area  occupied  by  the  particles  grows  more 
for  larger  /3  (see  Fig. 1(b)  and  (c))  and  smaller  r(see  Fig.l  (b)  and  (d)).  This  can  also 
be  confirmed  by  the  probability  density  functions  (not  normalized  to  one)  of  the  x-  and 
y-velocities  shown  in  Fig. 2. 

3  Grouping  Mechanisms 

To  consider  a  variety  of  motions,  we  use  the  form 


dXi  =  Uidt, 


dUi  =  —r{Ui  -  m  -  Vi)dt  +  (dijdWj.  (7) 

Here,  V  denotes  the  preferred  swimming  velocity,  swimming  towards  the  surface  for  exam¬ 
ple.  Now  we  distinguish  between  various  behaviors  of  swimming  organisms.  The  properties 
described  below  maybe  used  individually  or  together,  depending  upon  the  organisms  con¬ 
sidered. 

•  taxis:  V  depends  on  gradient  of  cue  field  VC(x,f)  (Fig. 3),  where  cue  C(x,  t)  may  be 
environmental  (food,  light,  depth,  etc.)  or  social  (positions  of  neighbors,  etc.).  This 
describes  a  large-scale  preferred  velocity  that  the  group  tends  to. 

VC(x,t) 


Figure  3:  Representation  of  taxis. 

Here,  we  define  taxis  as  a  preference  for  moving  up  the  gradient  of  the  cue  held, 

V  =  aVC(x). 


As  an  example,  we  consider 


C  =  Co  [1  —  cos(fcx)]  /2  (8) 

so  that  V  =  Vo  sin(fcr)x  and  A  =  —  r  [U  —  u  —  Vo  sin(fcx)x].  Figures  4  and  5  show  the 
distribution  and  the  PDF  (not  normalized  to  one)  of  the  organisms  taken  from  numerical 
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simulations,  respectively.  Taxis  on  a  spatially  and  temporally  fixed  cue  field  has  its  center 
of  aggregation  where  V  •  Vprej  (or  V2C)  is  most  negative  when  there  is  no  advection  (see 
Fig. 4  (b)),  and  the  advection  can  shift  this  downstream  (see  Fig. 4  (b)-(d)),  which  decreases 
the  strength  of  the  aggregation  as  a  result  (see  Fig.4  (b),  (c),  and  (d)).  We  can  also  see 
that  the  diffusion  coefficient  k  =  f32/2r 2  controls  the  width  of  the  aggregation  (Figs. 4,  5). 


X 


X 


Figure  4:  Taxis:  Numerical  simulation  results  of  the  example  described  in  Section  3  showing 
organism  distribution  (blue  dots)  and  the  profile  of  C  (green  line)  with  (a)  the  initial 
conditions  for  all  cases,  (b)  u  =  0,/3  =  0.7,  t  =  30,  (c)  u  =  0.5,  f3  =  0.7,  t  =  30,  (d) 
u  =  0.5,  &  =  0.35,  t  =  30,  (e)  u  =  1.1,  p  =  0.7,  t  =  30. 


We  then  define  social  taxis  as  grouping  when  the  cue  field  is  defined  in  terms  of  the 
positions  of  other  organisms; 


C(X)  =  £ 


X'  L 


i|x'  -  X|4  -  i|X'  -  X|2  ,  |X'-X|<1, 


(9) 


so  that  the  VC  can  be  expressed  as  VC  =  (X  —  X')u;(|(X  —  X')|)  with  weight  function  w 
having  the  profile  shown  in  Fig. 6.  Using  this  form  of  social  taxis,  the  temporal  variation  of 
the  aggregation  of  the  considered  organisms  depends  upon  the  displacement  of  each  set  of 
two  organisms,  i.e.  |X'  —  X|,  and  so  groups  close  to  each  other  merge  as  time  progresses  as 
shown  in  Fig. 7  (note  the  groups  in  red  circles). 


•  kinesis:  (5  depends  on  the  cue  field: 

P  =  /3(C).  (10) 

This  describes  an  individual’s  tendency  to  move  randomly,  when  they,  for  example, 
find  some  food  close  to  them  (Fig. 8),  and  is  a  more  primitive  response  in  which  the 
random  accelerations  increase  or  decrease  depending  on  the  cue  field. 
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Figure  5:  Taxis:  Histograms  from  numerical  simulations  of  organism  distribution  in  the 
x-direction  for  (a)  u  =  0,  /3  =  0.7,  t  =  30,  (b)  u  =  0.5 , /3  =  0.7,  t  =  30,  (c)  u  =  0.5, /?  = 
0.35,  t  =  30,  (d)  u  =  1.1  ,/3  =  0.7,  t  =  30. 


Figure  6:  An  example  of  the  weight  function  icdX'  —  X|)  used  in  social  taxis.  Here, 
w(x)  =  (|x|3  —  \x\)/x. 
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Figure  7:  Social  Taxis:  The  temporal  variation  of  organism  distributions  from  numerical 
simulations.  From  left  to  right,  the  panels  show  the  organism  distribution  at  times  t  = 
0,  t  =  27,  and  t  =  28. 


Figure  8:  Representation  of  kinesis. 

Let  f3  =  (3 o  —  (3\C/Co,  for  example,  and  consider  C  given  by  (8)  again.  Figures  9  and  10 
show  example  distributions  of  organisms  from  numerical  simulations.  Kinesis  can  produce 
aggregation  since  (3  decreases  as  C  increases,  which  means  dU/dt  decreases  as  C  increases. 
The  groups  tend  to  be  looser  however,  depending  upon  (3max/ f3min,  and  u  as  mentioned 
before. 

As  another  example  of  kinesis,  we  assume  that  the  organisms  turn  more  frequently  and 
have  smaller  mean  free  path  in  the  presence  of  many  neighbors.  Now  we  consider  (3  in  the 
form  (3  oc  exp  (—C/Co),  and  write  social  kinesis  as 

C(X)  =  [i  -  |X  —  X'|2]  ,  |X  -  X'|  <  1. 

X' 

This  does  not  include  the  fourth  order  term  as  in  (9),  since  social  kinesis  appears  in  (3 
as  a  non-derivative  form.  The  possible  situations  that  can  be  produced  are  some  random 
turning,  more  random  turning,  and  the  avoidance  of  others  as  shown  in  Fig. 11. 
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(b)  u  =  0,  &  =  2. 7.  &  -  2.  t  =  30.  000 


(c)  u  =  0,  /So  =  2  7.  /9,  =  2.  6.  t  =  30.  000  (d) 


u  =  0.  5.  ^o  =  2.  7.  /Si  =  2. 4 .  t  =  30.  000 


0  2  4  6  8  10 


Figure  9:  Kinesis:  Distributions  of  organisms  from  numerical  simulations  with  (a)  initial 
time  for  all  cases,  (b)  u  =  0,  [3q  =  2.7,  (3\  =  2.0,  t  =  30,  (c)  u  =  0,  Pq  =  2.7,  =  2.6,  t  =  30, 

(d)  u  =  0.5, 0 o  =  2.7,  pi  =  2.4,  t  =  30. 


(a)  11  =  0.  Ai=2.7,  Si  =  2.  t  =  30.000  (b)  u  =  0.  2.  7,  ft  =  Z  6.  t  =  30. 000  fc)  u  =  0.  5.  0o=  2.  7,  ,8-  =  2.  4.  t  =  30.  000 


XXX 


Figure  10:  Kinesis:  Histograms  of  organism  distributions  in  the  x-direction  from  numerical 
simulations  for  (a)  u  =  0,/?o  =  2.7,  /5i  =  2.0,  t  =  30,  (b)  u  =  0,/3o  =  2.7,  /5i  =  2.6,  t  =  30, 
(c)  u  =  0.5,  Pq  =  2.7,  Pi  =  2.4,  t  =  30. 
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Figure  11:  Social  kinesis,  (a)-(d)  Distributions  of  organisms  from  numerical  simulations 
showing  (a)  the  initial  distribution  for  all  cases,  distributions  after  the  time  shown  in  the 
cases  (b)  with  some  random  turning,  (c)  more  frequent  random  turning,  and  (d)  organisms 
avoiding  neighbors;  and  (e)  the  temporal  variation  of  spatial  average  of  C  for  panels  (c) 
(blue)  and  (d)  (black). 
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In  the  case  that  organisms  turn  randomly  occasionally,  the  system  ends  up  with  a 
large  group  of  organisms  (Fig.  11(b)),  whilst  the  other  two  cases  end  up  with  much  looser 
groups  (Fig. 11(c)  and  (d)).  Comparing  the  cases  of  turning  randomly  and  avoidance,  the 
avoidance  case  shows  stronger  grouping,  since  the  small  mean  free  path  interferes  with 
the  avoiding  behaviour.  This  can  also  be  verified  by  the  temporal  variation  of  the  spatial 
average  of  the  social  kinesis  C,  which  measures  the  strength  of  the  grouping;  the  avoidance 
case  shows  a  much  larger  value  of  average  C  than  that  of  the  case  with  frequent  random 
turning(Fig.  1 1  (e) ) . 

•  Schooling:  V  depends  on  neighbor’s  U  with  |V|  having  a  fixed  value  (Fig. 12):  V  = 
V  (Unejghbors )  •  This  describes  the  behavior  of  the  organisms  that  tend  to  swim  simi¬ 
larly  to  their  neighbors. 


P^  -Pp  p' 

P„  ■%>  -P-Pj&P 
p  P  P  -P  -P 

si  si  ~sr> — >  _ 


Figure  12:  Representation  of  schooling. 

The  preferred  direction  of  the  swimming  organisms  results  from  a  combination  of  at¬ 
traction  and  alignment  tendencies,  and  so  schooling  can  be  represented  as 

V  =  R0Vi/|V1|, 

V!  =  aJ^(X'  -  X)w(|X'  -  X|)  +  ^  U'wflX'  -  X|). 

X'  X' 

The  first  term  represents  attraction  to  neighbors,  a  controls  the  strength  of  the  attraction; 
larger  a  gives  stronger  schooling  as  seen  in  Figure  13  which  shows  the  temporal  variation  of 
organism  distributions  from  numerical  simulations  for  different  a.  Paying  attention  to  the 
groups  surrounded  by  circles,  we  notice  that  the  case  a  =  0.3  shows  weak  schooling  and  the 
groups  tend  to  break  apart  easily,  whilst  the  case  a  =  0.7  shows  much  stronger  schooling. 
The  second  term  represents  the  tendency  of  organisms  to  align  their  swimming  with  their 
neighbors.  The  distance  between  two  organisms  at  which  alignment  becomes  effective  tends 
to  be  smaller  than  that  for  attraction,  as  shown  in  Fig.  14.  Here,  the  choice  of  the  weighting 
function  w  affects  the  internal  structure  of  self-organized  groups. 
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Figure  13:  Schooling:  Snapshots  of  organism  distributions  from  numerical  simulations,  (a) 
the  initial  condition  for  all  cases,  (b-1)  a  =  0.7,  t  =  580,  (b-2)  a  =  0.7,  t  =  585,  (b-3) 
a  =  0.7,  t  =  590,  (c-1)  a  =  0.3,  t  =  615,  (c-2)  a  =  0.3,  t  =  620,  (c-3)  a  =  0.3,  t  =  625,  (c-4) 
a  =  0.3,  t  =  630. 


Figure  14:  Schooling:  Areas  of  attraction  (outer  circle)  and  alignment  (inner  circle).  The 
gray  dots  represent  organisms. 
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The  Boltzmann  Equation 


We  are  interested  in  finding  the  form  of  the  probability  density  function  of  the  system, 
P(Xi,  Ui,  X2, U2, . jt).  The  evolution  of  this  quantity  is  described  by 

=  -  VXi  •  (U iV)  -  VUi  •  (A (P)  +  I  dnU'  T(U  -  U'\x,  U',  t )  V.  (11) 

If  V  is  sharply  peaked  in  velocity,  the  last  term  on  the  RHS,  which  can  be  thought  of 
as  a  ‘collision’  rate,  can  alternatively  be  written  as 

f  fiimfijm 

SUiUj  V  2  J  ■ 

If  we  assume  independence  of  organisms  i,  which  is  not  true  for  social  behavior,  the 
probabilities  are  equal  for  all  i,  and  we  can  rewrite  the  evolution  as 

^  =  -Vx  •  (UP)  -  Vu  •  (A V)  +  \v2J2V-  (12) 

The  first  term  on  the  RHS  represents  advection  in  the  x-direction,  proportional  to 
the  velocity  at  that  point,  the  second  term  represents  advection  towards  the  x-axis  (if 
A  ~  — rU),  and  the  third  term  represents  diffusion.  The  first  two  terms  are  depicted  in 
Figure  15.  As  can  be  seen,  organisms  will  be  directed  towards  the  x-axis  and  parallel  to 
it,  resulting  formation  of  a  thin  filament,  which  is  restrained  from  reaching  zero  width  by 
diffusion.  In  taxis ,  A,  the  advection  towards  the  x-axis,  depends  on  position  X,  and  in 
kinesis ,  /3,  the  diffusive  term,  depends  on  position. 


Figure  15:  Representation  of  the  directions  of  the  first  (dashed)  and  second  (dotted)  terms 
on  the  RHS  of  equation  (12).  In  effect,  organisms  filament  along  the  x-axis,  although  the 
diffusive  term  stops  the  width  from  going  to  zero. 
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Examples 


Example  solutions  to  equation  (12),  if  we  define  as  before  /5  =  /3o  —  P\C/Cq  and  A  = 
— r(U  -  u),  can  be  seen  in  Figure  16.  Figure  16b)  shows  that,  with  diffusion  constant  in 
space,  i.e.  [3\  =  0,  the  organisms  group  in  velocity,  but  not  in  space.  Adding  a  background 
flow,  u,  or  diffusion  dependent  on  the  cue  field  C(x),  causes  grouping  in  both  |U|  and  |X|, 
see  Figures  16d)  and  g). 


Figure  16:  Plots  showing  the  results  of  solving  the  Boltzmann  equation  (12).  In  a)  c)  and  f), 
the  velocity  of  the  organisms,  U,  is  plotted  vs.  |U|  and  |X|,  in  b)  d)  and  g),  the  probability 
density  function  V  at  large  t  is  plotted  vs.  |U|  and  |X|  and  in  e)  and  h),  the  density  of 
organisms  p  at  large  t  is  plotted  vs.  |X|.  Plots  a)  and  b)  have  /3q  =  2,  =  0  and  u  =  0. 

Plots  c)  d)  and  e)  have  (3q  =  1,  /3\  =  0  and  u  =  2.  Plots  f)  g)  and  h)  have  (3q  =  2.7,  j3\  =  2.4 
and  u  =  0.  All  quantities  have  been  non-dimensionalised. 


Density 

Let  us  define  the  density,  p  =  f  dXJV,  and  thus  the  evolution  is  defined 

^  vx  j dU  XJV  =  -  VXU p,  (13) 
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where  the  bar  indicates  a  space-mean. 


Velocity 

This  leads  to  the  following  expression  for  the  evolution  of  the  quantity  pU, : 

=  Vx.ipThTT“  •  T*W,)-  (14) 

We  can  also  write  —  Vu (AV)  =  —  Vu/(— ?’U  —  r\J')V,  and  then  replace  these  terms  back 
into  equation  (12).  It  can  be  seen  that  the  system  is  not  closed,  but  if  we  assume  that  the 
three  terms  in  (12)  roughly  balance,  i.e. 

U7D“«U,:U j  +  T^dij, 

then  the  system  is  closed,  and  V  is  now  quasi-Gaussian.  We  can  thus  derive  the  closed  mass 
equation 

^  +  V-(Up)  =  0,  (15) 

and  the  momentum  equation 

_  _  IB2  _ 

+  UjVjUj  +  -V^p  =  — r(Uj  -  u*  -  Vi).  (16) 

Drag  Dominated  Case 

If  drag  dominates,  then  we  can  set  the  velocity  U,  and  so  the  momentum  equation  becomes 

dp  1  /32 

^  +  V-(up  +  Vp)  =  V.-vL-p,  (17) 

Taxis  vs.  Kinesis 

Let  us  re-write  equation  (17)  in  terms  of  diffusivities: 

^  +  V-(u  +  V  - -V(r/e)p)  =  V  • /eVp,  (18) 

ot  r 

where  the  diffusivity  k  =  (32 /2r2  and  is  spatially  varying.  The  third  term  on  the  LHS 
represents  taxis ,  and  the  fourth  kinesis.  In  this  form  it  can  be  seen  that  both  act  similarly 
on  the  system,  converging  velocities  to  certain  regions,  either  those  with  smaller  velocities 
(taxis)  or  low  diffusivities  (kinesis).  This  can  also  be  seen  in  Figure  17,  which  shows  the 
initial  and  final  states  of  organisms  experiencing  either  (taxis)  or  (kinesis),  as  well  as  the 
final  density  states  in  both  cases.  However,  kinesis  has  similar  diffusivities  and  velocities, 
i.e.  Peclet  number  is  of  order  1,  while  the  velocity  in  taxis  can  be  higher. 
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Figure  17:  Numerical  simulations  of  the  system,  assuming  either  taxis  in  a)-c),  i.e.  V  = 
aVC  is  spatially  varying  but  j3  is  not,  or  kinesis  in  d)-f),  i.e.  /3  is  spatially  varying  but  V  is 
not.  The  green  line  in  Panels  a)  and  b)  shows  the  distribution  of  the  concentration  field  C. 
Panels  a),  d)  and  b),  e)  show  the  initial  and  final  distributions  of  organisms.  Panels  c)  and 
f)  show  the  solutions  of  the  Boltzmann  equation  (12),  [blue  line],  the  density  equation  (13) 
[green  line]  and  a  histogram  of  the  density  in  the  final  snapshot  of  the  simulation.  Note  that 
the  blue  and  green  lines  lie  on  top  of  each  other  in  Panel  f).  These  show  that,  while  the 
biological  mechanisms  behind  them  is  different,  both  processes  produce  similar  final  density 
distributions. 
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Stability  with  social  behavior 

In  the  absence  of  a  background  flow,  i.e.  u  =  0,  we  redefine  V  =  V  +  yV  •  (rn),  then 

dp 

Yt  +  V  •  (Vp)  =  V  •  KVp.  (19) 

The  social  behavior  is  specified  by  making  k  a  functional  of  the  density,  defined  in  terms  of 
a  weighting  function  w,  such  that 

k  =  K(  j  dsw(s)p(X.  +  s)), 

and  w  takes  the  form  of  decreasing  with  s,  which  represents  the  sensing  distance  of  the 
organism,  see  Figure  6.  If  the  basic  state  density  is  uniform,  then  k  is  constant  in  space, 
and  we  can  rewrite  p  and  n  as  a  sum  of  the  means  (over-bars)  and  small  perturbations 
(primes),  and  so  the  density  equation  becomes 


f)  /V 

-di  +  pV-  V'  =  V-TcVp',  (20) 

where 

V'  =  -Vk'  =  —K'(p)V  J ds  w(s)p'(X.  +  s).  (21) 

Taking  a  Fourier  transform  of  this  equation,  we  can  find  an  expression  for  the  growth 
in  density  perturbations: 

^p'  =  [pK'(p)k2w(k)  -  Pk2]p.  (22) 

If  the  bracketed  term  is  greater  than  zero,  the  system  experiences  positive  growth  see 
Figure  18,  which  shows  the  relationship  between  growth  rate  and  wave  number  k.  In  fact, 
the  system  experiences  explosive  (faster  than  exponential)  growth,  with  the  amplitude  going 
as 

BA 

=  cr(k)A  +  J\f A3 , 

where  N  >  0. 


Evolution  of  Social  Behavior 


I:  Inheritance  of  Traits 


As  before,  we  define  the  velocity  V  as  the  gradient  of  a  scalar,  i.e.  V  =  V</>,  see  [7],  where 


(j)  =  wo  Jdsw(s)Z(X.  +  s),  Jdsw(s)  =  l. 


(23) 


In  this  case,  the  weighting  function  w( s)  takes  the  form  depicted  in  Figure  19  with  repulsion 
at  short  distances. 

We  define  three  different  genetic  types,  or  alleles,  of  the  heterotroph  Z  (which  preys  on 
P ),  each  with  two  genes  with  two  possible  genetic  traits,  0  or  1:  Zoo,  -^oi,  -^li-  -^oo  and  Z\  \ 
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Figure  18:  Figure  from  [7],  showing  “the  linear  growth  rate  of  small  perturbations  around 
the  equilibrium  biomass  versus  the  wave  number  of  the  perturbations,  . . .  for  different  val¬ 
ues  of  t”,  oc  l/«,  a  diffusion  time  scale.  “In  a  uniform  distribution  of  particles,  random 
variations  will  spontaneously  lead  to  the  formation  of  patches,  if  there  is  positive  growth 
somewhere  along  the  wave  number  spectrum.  The  gray  line  shows  the  case  where  particle 
motion  is  decoupled  from  biological  activities.  . . .  When  the  diffusion  rate  is  fast  compared 
to  growth  (r  <<  1),  population  dynamics  have  a  negligible  impact  on  the  linear  growth 
rate  of  perturbations  in  the  region  of  instability  (positive  growth).  As  biological  processes 
become  faster,  their  effect  ...  [is  to]  stabilize  the  equilibrium  solution;  the  growth  rate  is 
negative  for  all  wave  numbers  when  r  ~  1” . 


Figure  19:  Figure  from  [7],  showing  the  weighting  function,  w(x),  of  organisms  as  function 
of  their  distance  from  neighbors  located  at  x  =  0,  as  used  in  equation  (23).  uw( x)  = 
exp(—x2/2)—exp(—2x2).  Distances  are  expressed  in  units  of  L,  the  characteristic  perception 
length  scale.  The  sensing  radius  has  an  approximate  value  of  2 L.  . . .  Positive  weighting 
indicates  attraction  to  neighbors.  Repulsion  at  short  distance  prevents  over-crowding.” 
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are  recessive,  and  only  Z\\  exhibits  grouping  behavior,  i.e.  Voo  =  Voi  =  0,  Vu  /  0  .  The 
population  of  each  type  evolves  as  follows: 


9  _  P  fgfi 

dfZap  a9p  +  phZ  +  c 


dZap  —  V  •  (Vap  —  PS7)Zap  +  Map, 


(24) 


where  0  <  a  <  1  is  the  conversion  efficiency  of  P  into  biomass  by  Z,  0  <  g  <  1  is  a  grazing 
rate,  d  is  the  death  rate,  P  is  the  population  of  prey  and  fap  is  the  inheritance  fraction  of 
the  population  Zap.  P \  is  the  half-saturation  density,  which  provides  a  limitation  to  the 
amount  of  prey  that  can  be  eaten  at  large  P,  i.e.  Z  depends  linearly  on  P  until  the  capacity 
of  Z  to  eat  P  becomes  reached.  The  c  term  represent  the  “Allee”  effect  [1]:  as  the  density  Z 
is  small,  encounters  between  male  and  female  organisms  are  rare  and  depend  quadratically 
on  density,  whereas  for  large  Z  virtually  all  females  have  mated  and  reproduction  depends 
linearly  on  the  population  gets  large,  virtually  all  females  have  mated.  From  a  Mendelian 
table  of  inheritance [4]  we  can  define  the  fractions  fap  as: 


/oo 

/oi 

/ii 

Z2 


Zoo  +  ZqqZqi  +  -Zq  1, 


ZoqZqi  +  2ZoqZu  +  -Z01  +  ^oi-^iii 

Z\i  +  ZnZoi  +  -^oi> 

/oo  +  foi  +  /ll  =  (^oo  +  ^oi  +  Zu)2, 


where  this  implies  that,  for  example,  a  new  generation  of  Zoo  is  produced  by  all  interactions 
between  two  Zoo,  half  of  the  interactions  between  Zoo  and  Zoi,  and  1  in  4  of  interactions 
between  two  Zoi- 

We  also  include  a  mutation  term,  Map,  which  allows  for  the  possibility  of  a  population 
reforming  if  it  dies  out  completely: 

Moo  =  — /rZ0o  + -/xZ0i, 

Mo  o  =  —  fiZoi  +  //Zoo  +  yuZn, 

M\\  =  —  I^Zn  +  -//Zoi, 

where  ^  is  the  fractional  rate  of  mutation.  The  prey,  P  evolve  as  follows; 


dP 

~dt 


bP 


p  z2 

9P  +  PhZ  +  c 


+  VkVP, 


(25) 


where  b  is  the  growth  rate,  Pc  is  the  half-saturation  density,  and  all  other  quantities  are  as 
before. 


Examples 

There  are  specific  conditions  under  which  the  grouping  type,  11,  can  win  out  with  respect 
to  the  other  two  types.  If  the  grouping  strength  is  not  too  high  and/or  the  diffusivity  is  not 
too  low,  then  it  can  cause  the  extinction  of  the  other  two  species,  due  to  the  reproduction 
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advantages  of  grouping,  see  Figures  20a)  and  b).  However,  if  the  diffusivity  is  too  low  to 
bring  the  grouping  type  sufficient  prey,  then  the  other  two  types  will  dominate,  while  the 
grouping  type  will  die  out  until  the  groups  are  sufficiently  small  to  survive,  see  Figures  20c) 
and  d).  If  the  grouping  species,  11,  is  introduced  at  low  values  to  the  other  two  types,  then 
it  will  slowly  take  over  until  it  reaches  a  critical  fraction,  /c,  at  which  the  other  two  species 
become  extinct  and  the  grouping  type  wins,  see  Figure  21. 


Figure  20:  Final  state  population  distributions  Zap  from  numerical  simulations  (a)  and  c)) 
shown  next  to  the  fractional  populations  /Q(g  over  time  (b)  and  d)).  In  a)  and  b),  the  species 
Z\\  causes  the  other  two  to  die  out,  due  to  the  advantages  of  grouping  for  reproduction. 
However,  if  the  diffusivity  is  low,  as  in  c)  and  d),  then  the  disadvantages  of  grouping,  i.e. 
the  fast  depletion  of  prey,  mean  that  Zqq  and  Zoi  do  much  better  than  Z n,  which  doesn’t 
completely  die  out. 

If  we  subject  the  system  to  background  stirring  motion,  such  as  Ray  Pierrehumbert’s 
exact  sin/sin  stirring[5],  see  Figure  22(1),  then  we  can  test  whether  grouping  is  still  advan¬ 
tageous  or  not,  see  Figures  22(ii)-(v).  It  can  be  seen  that,  at  low  grouping  parameters  and 
low  stirring,  the  grouping  type  dies  out,  but  at  large  grouping  parameters  and  large  stirring, 
the  grouping  type  wins.  In  other  regimes,  both  types  survive. 

Thus,  it  is  not  only  the  biological  but  the  dynamical  features  which  can  determine  how 
a  particular  species  will  fare. 


62 


Figure  21:  Frequency  of  Zap  versus  time,  showing  the  invasion  of  a  non-grouping  population, 
Z00,  by  the  grouping  type,  Zu,  taken  from  [7].  “Time  is  normalized  by  the  mutation 
timescale  . . .  The  new  allele,  introduced  at  low  frequency  at  t  =  0  does  not  affect  fitness 
until  the  frequency  of  grouping  organisms  (type  11)  reaches  fc,  at  which  part  it  starts  to 
aggregate.  ...  In  this  case,  social  behavior  is  a  successful  strategy:  the  grouping  type  wins 
the  competitions  and  the  other  types  become  extinct.  The  ecosystem  is  then  stable  with 
respect  to  mutations  in  the  two-allele  model.” 


II:  Grouping  Parameter 

Examples  of  how  diffusion  and  grouping  affect  the  dynamics  when  limit  cycles  may  be 
present  (through  setting  the  value  of  c  in  (24)),  can  be  seen  in  Figure  23. 

To  find  optimal  parameters,  we  let  populations  with  different  grouping  strengths  com¬ 
pete,  and  note  whether  their  populations  grow  or  decay.  The  effect  of  the  half  saturation 
ratios,  P 'h  and  c  in  equation  (24),  on  the  optimal  grouping  strength  can  be  seen  in  Figure  24. 

Ill:  Schooling 

As  before,  we  represent  schooling  as  velocity  alignment  with  neighbors  and  attraction  (ex¬ 
cept  very  nearby).  We  add  a  food  supply,  confined  to  the  mixed  layer  at  the  top  of  the 
ocean,  with  variability  produced  by  upwelling  which  brings  further  nutrients.  As  the  mixed 
layer  is  by  definition  assumed  well-mixed,  downwelling  does  not  change  the  concentration 
of  nutrients.  We  specify  a  streamfunction,  -0,  and  a  vertical  velocity,  w,  that  depend  on  the 
depth  of  the  thermocline,  h: 

I  9'  h  3h  fOR\ 

i/j  =  h,  w  =  (26) 

where  g'  is  the  modified  gravitational  acceleration  and  /  is  the  planetary  vorticity.  We  can 
then  derive  the  evolution  of  the  nutrient,  N: 

ON  w(w  >  0) 

1  R  }  ( Ndeep  —  N)  —  gNP  +  V.KfluidVN ,  (27) 

where  H  is  the  average  depth  of  the  thermocline,  Ndeev  is  the  nutrient  value  at  depth,  and 
P  is  the  population  of  the  autotroph,  e.g.  phytoplankton.  Example  biomass  distributions 
from  numerical  simulations  can  be  seen  in  Figure  25. 
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Figure  22:  (i)  depicts  the  form  of  the  Pierrehumberts  exact  sin/sin  stirring,  (ii)  —  (iv) 
show  snapshots  from  numerical  simulations  of  Z\  \  (left  panels)  and  a  passive  tracer  (right 
panels);  (ii)  and  (in)  have  no  diffusion,  i.e.  n  =  0,  (ii)  and  (iv)  are  subject  to  a  medium 
rate  of  stirring,  and  (Hi)  is  not  subject  to  any  stirring,  (iv)  shows  that,  while  diffusion 
eventually  completely  removes  any  gradients  in  the  tracer,  Z\\  still  remains  grouped,  (v) 
shows  a  summary  of  the  conditions  where  grouping  and  non-grouping  Z  survive,  from  [7]. 
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Figure  23:  Numerical  simulations  of  predator  biomass,  Z .  distribution  in  time  and  space, 
from  [6].  “Thin  black  contours  show  the  instantaneous  spatial  distribution,  at  fixed  inter¬ 
vals  in  time.  Thick  black  curve  shows  the  time  series  of  biomass  at  a  fixed  location,  a) 
Without  aggregation  behavior”  (wq  =  0),  “the  predator-prey  system  reaches  a  limit  cy¬ 
cle  after  a  period  of  transients;  the  simulation  shown  here  is  initialized  near  the  unstable 
coexistence  fixed  point,  b)  With  social  behavior”  (wq  =  2),  “and  fast  diffusion,  patches 
form  spontaneously  and  the  system  loses  its  oscillatory  behavior,  c)  With  social  behavior” 
(wo  =  15),  “and  slow  diffusion,  the  system  supports  a  regular  wave  traveling  through  the 
domain;  transients  are  not  shown,  d)  With  social  behavior”  (w o  =  5),  “and  slow  diffusion, 
the  density  field  can  appear  chaotic.” 
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a) 


b) 


Figure  24:  Optimal  strategy  vs  Ph  and  c,  from  [6].  x  is  non-dimensionalised  PyL .  A  is  non- 
dimensionalised  c,  and  S  is  our  w o-  “Black  symbols  indicate  optimal  parameter  values  of  [ico] 
from  numerical  simulations  of  the  competition  model,  solid  line  is  the  spline  interpolation. 
Gray  symbols  indicate  that  social  behavior  is  not  a  successful  strategy,  in  which  case  non¬ 
grouping  types  . . .  are  the  best  competitors.” 


We  model  foraging  by  assuming  that  the  turning  rate  of  an  organism  is  dependent  on 
the  difference  between  past  feeding,  £  compared  to  current  food  values.  Here  we  consider 
the  turning  rate  56  of  a  heterotroph,  Z,  in  terms  of  the  population  of  its  prey,  P.  The 
memory  of  food  evolves  as  follows: 

=  p{Xj)  _  & 

dt  P(Xi)  +  Ph  r’  1  j 

where  t  is  the  memory  period  of  the  heterotroph.  Then  we  model  the  turning  as 

56  =  1  +  tanh(a(&  -  P(X,:)),  (29) 

where  a  is  a  constant.  Figures  26  and  27  show  how  the  success  of  the  foraging  Z  depends 
on  how  fast  they  swim.  Foragers  that  are  too  speedy  over-shoot  the  largest  concentrations 
of  prey;  slow  foragers,  however,  don’t  find  the  prey  for  a  long  time.  Figure  28  shows  how 
successful  schooling  is  for  foragers  in  various  levels  of  turbulence  and  resources.  It  can  be 
seen  that,  for  low  levels  of  resources  there  is  no  clear  advantage  to  schooling,  and  it  can 
actually  be  a  disadvantage  at  low  levels  of  a,  the  ratio  of  alignment  to  attraction,  as  defined 
in  equation  (29).  However,  in  cases  with  medium  or  high  resources,  there  are  values  of  a 
that  provide  a  clear  advantage. 

Aggregation 

Observations  of  Antarctic  krill  swarms  can  be  seen  in  Figure  29.  Generally,  during  the  day 
there  tends  to  be  many  small  swarms  near  the  surface,  and  at  night  there  tend  to  be  a 
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Figure  25:  Figure  from  [6],  showing  results  from  numerical  simulations  with  schooling  and 
turbulent  advection.  a)  biomass  of  nutrients  N,  b)  biomass  of  phytoplankton  P.  “Lighter 
shades  of  gray  indicate  larger  values.  The  two  fields  tend  to  be  anti-correlated.” 
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b) 


Figure  26:  Figures  showing  numerical  simulations  of  foraging  Z  with  a)  fast  swimming 
speeds  and  b)  slow  speeds  at  various  non-dimensionalised  times.  The  white  dots  represent 
the  individual  Z,  the  black  tails  represent  the  organism’s  grazing  memory  and  the  color 
scale  represents  the  concentration  of  prey  P.  It  can  be  seen  that  the  fast  swimmers  over¬ 
shoot  and  never  all  end  up  concentrated  on  the  prey,  whereas  the  slow  swimmers  quickly 
group  over  the  high  concentration  of  prey. 
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Food  Intake 


Swimming  speed 


Figure  27:  Figure  summarizing  the  (non-dimensionalised)  food  intake  of  foraging  Z  with  a 
specific  swimming  speed  from  numerical  simulations.  Clearly  there  is  an  ideal  swimming 
speed  at  which  the  swimmers  find  the  food  within  a  reasonable  time  but  do  not  overshoot 
the  highest  concentration  by  swimming  too  fast. 


few  large  swarms  at  depths  (~250m).  Different  types  of  aggregation  can  be  classified  using 
a  measure  of  polarization:  the  degree  to  which  the  grouping  is  swarm-like  or  school-like. 
Figure  30  shows  results  from  numerical  simulations,  with  the  degree  of  polarization  plotted 
against  the  degree  of  alignment,  a,  and  the  swimming  velocity,  V .  Five  specific  grouping 
phases  are  identified  and  examples  shown  of  each. 


Conclusion 

It  can  be  seen  that  a  wide  range  of  grouping  structures  in  organisms  can  be  simply  mod¬ 
elled.  We  have  seen  that  in  turbulent  environments  it  is  sometimes  advantageous  to  school 
or  group,  depending  on  the  properties  of  the  organisms  themselves  as  well  as  how  the  envi¬ 
ronment  affects  their  supply  of  food.  In  general,  it  seems  that  there  is  the  need  for  a  certain 
amount  of  resource  availability  to  make  it  advantageous  for  foraging  predators  to  school. 
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more  resources 


Figure  28:  The  foraging  success  of  schooling  Z,  taken  from  [6],  “normalized  by  the  success  of 
random  walkers  under  the  same  conditions.  In  each  panel,  the  relative  success  ...  is  shown 
for  different  strategies  ( a ,  the  ratio  of  alignment  to  attraction” ,  see  equation  (29)  “varying 
between  0  and  1).  Each  column  corresponds  to  a  different  level  of  resource  availability;  each 
row  has  different  flow  characteristics.”  In  the  low  resource  panels,  there  is  no  advantage  to 
schooling,  and  in  fact  it  can  be  a  disadvantage  at  low  values  of  a.  However,  with  medium 
to  high  resources,  there  are  values  of  a  that  provide  a  clear  advantage. 
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Figure  29:  Observations  of  Antarctic  krill  from  acoustic  measurements  [3].  Points  represent 
aggregations,  which  are  plotted  as  a  function  of  the  time  of  day  of  observation  (x-axis)  and 
mean  depth  (y-axis).  The  size  and  color  represent  the  mean  density.  The  general  trend 
shows  a  few  patches  of  large  density  at  depth  at  night  and  many  patches  of  small  density 
near  the  surface  during  the  day. 
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Figure  30:  Simulations  results  from  [6].  The  gray  shading  indicates  the  degree  of  po¬ 
larization;  high  vales  (light  gray)  represents  schooled  groups,  low  values  (black)  represent 
swarming  groups.  This  is  plotted  against  a,  degree  of  alignment,  and  V,  swimming  velocity. 
Labels  I-V  indicate  distinct  phases:  I  milling;  II  swarms;  III  large  schools;  IV  small  schools; 
V  transient  alignment. 
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Lecture  6:  Mixing  in  the  presence  of  sources  and  sinks 


Jean-Luc  Thiffeault 

Notes  by  Sam  Pegler  and  Woosok  Moon 
28  June  2010 


1  Norms 


In  this  section  we  define  a  measure  of  mixing  that  does  not  necessarily  require  diffusion  to 
measure  the  amount  of  homogenization  that  occurs  during  the  mixing  process.  Recall  the 
advection-diffusion  equation 

c)0 

—  +  u-V9  =  kV29,  (1) 

where  6  is  a  concentration  field  in  a  finite  domain  U,  with  no-net-flux  boundary  conditions. 
We  assume  without  loss  of  generality  that 


9  dfi  =  0, 


and  define  the  L2-norm,  or  variance,  as 


(2) 


\\e\\l=  [  e2dn.  (3) 

Jn 

Recall  from  Lecture  1  that  the  variance  evolves  according  to 

±\\9\\l  =  -2K\\ve\\l  (4) 

and  decays  in  time  as  the  system  mixes.  The  variance  indicates  the  extent  to  which  the 
concentration  has  homogenized  and  is  thus  a  good  measure  of  the  amount  of  mixing  that 
has  occurred.  However,  the  variance  requires  knowledge  of  small  scales  in  6,  which  we  are 
not  necessarily  interested  in.  A  measure  of  how  well-mixed  the  concentration  is  does  not 
necessarily  require  knowledge  of  how  much  homogenization  has  occurred  due  to  diffusion  at 
small  scales.  This  is  more  in  keeping  with  the  definition  of  mixing  in  the  sense  of  ergodic 
theory  [2].  In  this  regard,  we  proceed  to  consider  the  pure  advection  equation 

^  +  u-  V6  =  0 
ot 

Note  that  in  this  case  equation  (4)  predicts  that  the  variance  satisfies 

>  11  =  0.  «» 


(5) 


74 


and  cannot  therefore  be  used  as  a  measure  of  mixing. 

The  advection  equation  (5)  takes  us  closer  to  the  ergodic  sense  of  mixing  in  which  we 
think  of  the  advection  due  to  the  velocity  field  as  a  time-dependent  operator  /S'*  :  T2  — >•  0 
that  moves  an  initial  patch  of  dye  according  to 

6q(x)  i->-  9(x,t)  =  St6o(x).  (7) 


If  we  consider  a  region  A  of  uniform  concentration  defined  by 


9q{x) 


l  if  x  e  A, 

0  otherwise, 


(8) 


then  the  volume  of  the  patch 

Vo\[6(x,t)]  =  Vo\(A),  (9) 

remains  constant  in  time  by  incompressibility.  We  can  associate  the  volume  of  the  patch 
with  the  Lebesgue  measure  and,  because  of  the  result  (9)  above,  St  is  measure-preserving. 
We  define  mixing  in  the  sense  of  ergodic  theory  by 

lim  Vol[A  n  S\B)}  =  Vol(A)Vol(R),  (10) 

f->-oo 

for  all  patches  A,  B  £  f l.  This  definition  follows  our  intuition  for  what  good  mixing  is. 
Referring  to  figure  1,  when  the  system  is  well-mixed  the  intersection  of  A  and  StB  is 
proportional  to  both  Vol(-A)  and  Vol(R).  Thus,  if  the  condition  (10)  holds  then  St  must 
spread  any  initial  patch  throughout  the  domain.  This  condition  is  referred  to  as  strong 
mixing  and  can  be  shown  to  imply  ergodicity. 

The  intersection  of  the  advected  patch  B  with  the  reference  patch  A  is  analogous  to 
projection  onto  L 2  functions.  This  motivates  the  following  weak  convergence  condition 


lim  (9(x,t),g)  =  0, 


(11) 


SlB 


Figure  1:  An  advected  patch  StB  that  has  undergone  strong  mixing.  At  late  times  the 
patch  covers  an  arbitrary  reference  patch  A. 
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for  all  functions  g  E  L2(D),  where  the  inner  product  is  defined  by 

(f,9)=  t  f(x)g(x)dn,  (12) 

Jn 

and  /  E  L2(Q )  if  fQ  |/|2  dfl  <  oo.  Weak  convergence  is  equivalent  to  mixing  as  a  conse¬ 
quence  of  the  Riemann-Lebesgue  lemma.  The  equivalent  conditions  (10)  and  (11)  require 
computing  over  all  patches  A  or  functions  g,  respectively.  Thus,  neither  of  these  conditions 
is  useful  in  practice.  However,  we  proceed  to  describe  a  theorem  that  shows  there  is  a 
simpler  way  to  determine  whether  or  not  weak  convergence  is  satisfied. 

Mathew,  Mezic  and  Petzold  [5]  introduced  the  mix-norm,  which  for  mean-zero  functions 
is  equivalent  to 

ll^llif-,/2  :=  1 1 V  1//26?  1 1 2  •  (13) 

Doering  and  Thiffeault  [1]  and  Lin,  Thiffeault  and  Doering  [3]  generalized  the  mix-norm  to 

P\\m--=  l|V90||2,  Q<  0,  (14) 

which  is  a  negative  homogeneous  Sobolev  norm.  This  norm  can  be  interpreted  for  negative 
q  via  eigenfunctions  of  the  Laplacian  operator.  For  example,  in  a  periodic  domain,  we  have 

ll«ll|,=£WW,  (15) 

k 

from  which  we  see  that,  for  q  <  0,  ||0||7.  smooths  6  before  taking  the  L2  norm.  The  theorem 
lim  ||0||jj9  =0,  q  <  0  <==>•  6  converges  weakly  to  0,  (16) 


Figure  2:  Comparison  of  the  mix- norms  for  a  flow  optimized  using  the  separate  methods  of 
optimal  control  and  optimal  instantaneous  decay.  Figure  from  Lin  et  al.  [3]. 
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t  =  0.6 


t  =  0.8 


Figure  3:  Evolution  of  the  concentration  field  for  the  flow  optimized  in  the  case  q  =  —  1  as 
computed  by  Lin  et  al.  [3]. 


due  to  Mathew,  Mezic  and  Petzold  [5]  and  Doering,  Lin  and  Thiffeault  [3]  shows  that  we 
can  track  any  mix- norm  to  determine  whether  a  system  is  mixing  (in  the  weak  sense). 
The  existence  of  this  quadratic  norm  makes  optimization  of  the  velocity  field  for  good 
mixing  considerably  easier.  Mathew,  Mezic,  Grivopoulos,  Vaidya  and  Petzold  [4]  have 
used  optimal  control  to  optimize  the  decay  of  the  q  =  —1/2  mix- norm.  Lin,  Doering  and 
Thiffeault  [3]  have  optimized  the  instantaneous  decay  rate  of  the  q  =  —  1  norm  using  the 
method  of  steepest  descent,  which  is  easier  to  compute  numerically  but  yields  suboptimal, 
but  nevertheless  very  effective,  stirring  velocity  fields.  A  comparison  of  the  methods  for 
optimized  mixing  is  shown  in  figure  2.  The  solid  line  decays  faster,  but  this  is  merely 
because  the  H cannot  be  compared  directly  with  FT-1/2.  The  corresponding  evolution  of 
the  concentration  field  for  the  case  q  =  —  1  from  Lin  et  al.  [3]  is  shown  in  figure  3. 
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2  Sources  and  Sinks 


Now  let  us  consider  the  situation  with  a  sources-sink  term  s(x,t), 

dt9  +  u-V9  =  nV2e  +  s{x,t),  (17) 

V  •  u  =  0. 

For  simplicity,  assume  that  f n  s(x,t)d£l  =  0.  Otherwise,  we  can  subtract  the  mean  of  9.  It 
is  convenient  to  think  of  sources  and  sinks  as  hot  and  cold  regions. 

Let  us  also  assume  that  our  sources  and  sinks  are  time- independent.  Then,  the  system 
eventually  achieves  a  steady  state  6(x)  that  satisfies 


u-V9  =  kV29  +  s. 

(18) 

We  define  the  operator 

C  =  y  •  V  —  kV2 

(19) 

so  that  (18)  can  be  written 

C9  =  s. 

(20) 

The  steady  solution  is  then 

9  =  C-ls 

(21) 

where  the  mean-zero  condition  on  9  makes  this  unique.  Note  that  k  /  0  is  needed  to 
achieve  a  steady  state.  So,  assuming  the  system  has  reached  a  steady-state,  we  have  to 
determine  how  we  measure  the  quality  of  mixing.  One  of  the  possible  ways  is  to  look  at 
the  norms  \\9 \\fjq,  where  q  =  0  represents  standard  derivation.  But  we  have  to  decide  what 
we  will  compare  to.  One  possibility  is  \\6 \\jjq/  |s \\jjq-  This  ratio  is  a  reasonable  choice,  but 
has  units  of  inverse  time.  It  is  preferable  to  use  a  dimensionless  quantity  for  measuring  the 
quality  of  mixing.  In  this  spirit,  we  define  mixing  enhancement  factors: 

\\d\\w 
q  P\W 

(22) 

where  9  is  the  purely-diffusive  solution  which  satisfies 

C9  =  s. 

Here,  £  =  — /-cV2  is  the  pure  diffusive  operator,  so  9  can  be  interpreted  as  the  solution  in  the 
absence  of  stirring.  Since  ||0||^9  is  usually  decreased  by  stirring,  eq  measures  the  enhance¬ 
ment  over  the  pure-diffusion  state.  Several  properties  are  given  in  Doering  and  Thiffeault  [1], 
Shaw,  Thiffeault  and  Doering  [7],  and  Thiffeault  and  Pavliotis  [8].  We  interpret  a  large  eq 
as  ‘good  stirring,’  since  it  in  that  case  the  norm  is  decreased  by  stirring. 

A  natural  question  is  whether  eq  can  ever  be  less  than  unity,  that  is,  if  stirring  can  ever 
be  worse  than  not  stirring.  Let’s  consider 


l|V0||2 
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Here, 

0  =  C~ls  =  (— kV2)  1  s  =  — k-1V-2s  ^  V0  =  -k^V-'s 
Also,  from  CO  =  s,  we  can  multiply  0  on  both  sides  and  take  spatial  average  and  then  get 

(0C0)  =  ( sO ) , 

where  (•)  =  fn  ■  dfl.  We  expand  the  left-hand  side: 

(0C0)  =  (Ou-VO)  -  n(0V20) 

=  (V-  (u0'2/2))  -  k(0V20) 

=  -k(0V20)  =  K(\V0\2). 

As  for  the  right-hand  side, it  can  be  written  as 

(Os)  =  (ov  •  v_1s>  =  -  (vo  •  v_1s>  =  k  (vo  ■  vo} , 

where  we  used 

0  =  C~1s  =  (-kV2)_1  s  =  -/e_1V-2s 
<=►  VO  =  -K-'V-'s. 

Recall  that  (|V0|2)  =  ll^ll2^.  Therefore, 

11^11^1  =  (vO-Vd)  <  j|V0||2||V0||2  =  11011^11011^. 

We  conclude  that 

II^IIri  <  Mfr  ^  S1  <1.  (23) 

This  is  somewhat  counter-intuitive  because  gradients  are  usually  increased  by  stirring.  How¬ 
ever,  the  gradients  in  a  steady-state  have  been  affected  by  diffusion. 

What  about  eq  for  values  of  q  other  than  1?  We  tried  and  failed  to  prove  eq  <  1,  simply 

because  it  is  not  true.  Following  a  challenge  by  Charlie  Doering  at  a  workshop  at  the  IMA 

in  2010,  Jeff  Weiss  came  up  with  something  like: 

u  =  (2  sin  x  cos  2 y  ,  —  cos  x  sin  2 y) ,  (24a) 

s  =  (cos  a:  —  ^)  siny  .  (24b) 

This  velocity  field  manages  to  concentrate  the  source  and  sink  distribution  more  than  dif¬ 
fusion  alone.  Streamlines  of  u  and  level  sets  of  s  are  shown  in  figure  4.  In  this  example,  we 
could  get  £o  —  0.978  and  e_i  ~  0.945,  which  are  slightly  less  than  1.  It  is  an  open  problem 
to  characterize  such  ‘unmixing’  flows. 
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(a) 


(b) 


x  x 

Figure  4:  The  pattern  of  velocity  field  and  source  for  the  ‘unmixing'  flow  and  source  distri¬ 
bution  (24). 

3  Optimization 

We  defined  the  mixing  enhancement  factors  based  on  Sobolev  norms.  Large  mixing  en¬ 
hancement  factor  indicates  good  mixing  for  a  given  source  and  sink  pattern.  One  of  the 
relevant  questions  in  this  step  is  what  kinds  of  flow  give  the  largest  eq  given  source  and  sink 
distribution  s(x). 

Here  is  a  simple  but  surprising  example.  The  source  and  sink  distribution  is  given 
by  s(x)  =  sinx  with  periodic  boundary  conditions  on  6.  The  optimal  solution  for  this 
source  and  sink  distribution  is  u  =  Ux,  which  is  constant  flow  from  the  hot  region  to  the 
cold  region  [6-8]  (figure  5).  This  example  demonstrates  that,  with  body  sources,  the  best 


Figure  5:  The  optimal  velocity  held  (solid  arrows)  for  the  source  distribution  s(x)  =  sinx. 
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Figure  6:  Optimal  stirring  velocity  field  (solid  lines)  for  the  source  s(x)  =  sin  x  sin  y  (colored 
background),  for  q  =  0  (left)  and  q  =  —  1  (right).  The  optimal  velocity  is  the  same  in 
both  cases  because  the  source  is  an  eigenfunction  of  the  Laplacian.  (Matlab  programs 
example  (1)  and  example  (2)  in  the  Appendix.) 


stirring  often  has  more  to  do  with  transport  than  with  creation  of  small  scales. 

More  generally,  we  have  to  solve  the  optimization  problem  numerically.  Figure  6  shows 
contours  of  the  streamfunction  for  the  optimal  stirring  velocity  (lines)  for  a  source  s(x)  = 
sin  x  sin  y,  for  q  =  0  and  q  =  —  1.  The  optimal  velocity  fields  are  identical  for  the  two  values 
of  q,  because  the  source  is  an  eigenfunction  of  the  Laplacian. 

Contrast  this  to  the  optimal  solutions  in  figure  7,  for  the  source  distribution  s(»)  = 
cos  x  cos  y  +  cos  3 y  +  (1/4)  sin3y.  This  source  is  not  an  eigenfunction  of  the  Laplacian,  and 
we  expect  optimal  solutions  to  depend  on  q.  Comparing  the  left  (q  =  0)  and  right  ( q  =  —  1) 
figures,  we  see  this  is  indeed  the  case,  though  the  difference  in  this  case  is  fairly  small. 

Finally,  given  an  optimization  code,  it  is  simple  to  turn  it  around  to  anti-optimize,  that 
is,  find  the  worst  stirring  velocity  for  a  given  source  distribution.  Figure  8  shows  this  for 
the  source  (24b)  and  q  =  0.  Note  how  the  velocity  field  seems  to  work  to  concentrate  the 
source  sink,  thereby  increasing  the  variance.  The  efficiency  for  this  anti-optimal  solution 
is  £q  =  0.9736,  which  is  not  much  lower  than  Jeff  Weiss’s  unoptimized  flow  (24a),  which 
had  £o  —  0.978. 

To  reproduce  the  5  figures  in  this  section  run  the  program  example(n)  in  the  Appendix, 
where  n  is  a  number  from  1  to  5. 


Appendix:  Matlab  code 

1  Program  file  example. m 

function  example (ex) 

7.  util  folder  contains  Diffmat2.m,  fourdif.m,  refine.m,  refine2.m,  refinek.m 
7.  Contact  author  to  obtain  these  functions, 
addpath  util 

if  nargin  <  1 ,  ex  =  1 ;  end 
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Figure  7:  Optimal  stirring  velocity  field  (solid  lines)  for  the  source  cos  x  cos  y  +  cos  3 y  + 
(1/4)  sin3y  (colored  background),  for  q  =  0  (left)  and  q  =  —  1  (right).  The  optimal  velocities 
are  different  since  the  source  is  not  an  eigenfunction  of  the  Laplacian.  (Matlab  programs 
example  (3)  and  example  (4)  in  the  Appendix.) 


I  1.1  1.1  1.1  M  1.1  1.1  1.1  1.1  1.1 


Figure  8:  Optimal  ‘unmixing’  solution  for  the  source  (cosx  —  sin  y,  with  mixing  effi¬ 
ciency  £q  =  0.9736.  (Matlab  program  example  (5)  in  the  Appendix.) 
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N  =  11;  °/0  Number  of  gridpoints 
L  =  1;  kl  =  2*pi/L;  #/«  Physical  size  of  domain 
x  =  L*(0:N-1)/N;  y  =  x’ ;  [xx,yy]  =  meshgrid(x,y) ; 

switch  ex 

•/.  Examples  1  and  2  use  the  same  cellular  source,  for  q=0  and  q=-l. 

°/0  Since  the  source  is  an  eigenfuntion  of  the  Laplacian,  the  optimal 
°/e  flow  is  the  same  for  q=-l. 
case  {1,2} 

src  =  cos (kl*xx) . *cos (kl*yy)  *  2/L; 

psiO  =  sin(kl*xx) . *sin(kl*yy)  *  l/sqrt(2)/pi; 

kappa  =  .1;  q  =  1-ex; 

l  Examples  3  and  4  use  the  same  two-mode  source,  for  q=0  and  q=-l. 

I  Since  the  source  is  not  an  eigenfuntion  of  the  Laplacian,  so  the  optimal 
1  flow  is  different  for  q=-l. 
case  {3,4} 

src  =  (cos(kl*xx) . *cos(kl*yy)+cos(3*kl*yy)+. 25*sin(3*kl*yy) )  *  4*sqrt (2) /5/L 
psiO  =  sin(kl*xx) . *sin(kl*yy)  *  l/sqrt(2)/pi; 
kappa  =  .1;  q  =  3-ex; 
case  5 

%  Unmixing  solution 

src  =  (cos(kl*xx)  -  . 5) . *sin(kl*yy)  *  2*sqrt (2/3)/L; 
psiO  =  sin(kl*xx) . *sin(2*kl*yy)  *  1/sqrt (5) /pi ; 

scalefac  =  -N~2;  °/0  Set  scale  factor  negative  to  minimize  instead 
kappa  =1/4;  q  =  0; 
end 

if  ~exist( ’scalefac’ ) ,  scalefac  =  N~2;  end 
[psi,Effq]  =  velopt (psiO, src, kappa, q,L, scalefac) ; 
fprintf  (1,  ’Ef  f  _°/od=0/0f  \n’  ,q,Effq) 
figure (1) 

Nplot  =  64;  l  Interpolate  solution  for  plotting 
psir  =  ref ine2(psi , Nplot) ;  srcr  =  ref ine2(src, Nplot) ; 
xplot  =  L* (0:Nplot-l) /Nplot ;  yplot  =  xplot’; 
imagesc (xplot ,yplot , srcr) ,  colorbar,  hold  on 
contour (xplot , yplot ,psir, 10, ’EdgeColor’ , ’k’ ) ,  hold  off 


2  Program  file  velopt. m 

function  [psi,Effq]  =  velopt (psiO , src , kappa, q,L, scalefac) 

°/«  Problem  parameters  for  Matlab’s  optimizer  fmincon. 
psiO  =  psi0(:);  problem. xO  =  psiO (2 rend); 

problem. objective  =  @(x)  normHq2(x, src, kappa, q,L, scalefac) ; 
problem. nonicon  =  @(x)  nonlcon(x, src, kappa, q,L, scalefac) ; 
problem. solver  =  ’fmincon’; 

problem. options  =  optimset ( ’Display ’,’ iter ’, ’TolFun’ , le-10 ,.. . 

’GradObj ’ , ’on’ , ’GradConstr’ , ’on’ , . . . 

’ algorithm ’ , ’ interior-point ’ ) ; 

[psi,Hq2]  =  fmincon (problem) ; 

1  Mixing  efficiency:  call  normHq2  with  no  flow  to  get  pure-conduction  solution 
Effq  =  sqrt (normHq2 (zeros (size (psi) ) , src, kappa, q,L, scalefac)  /  Hq2) ; 

psi  =  reshape  (  [0;  psi]  ,  size  (src))  ;  °/«  Convert  psi  back  into  a  square  grid 


%========================================================= 

function  [varargout]  =  normHq2 (psi , src , kappa, q,L , scalefac) 

N  =  size(src,l);  src  =  src(r); 
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7,  2D  Differentiation  matrices  and  negative-Laplacian 
[Dx,Dy ,Dxx,Dyy]  =  Dif fmat2(N,L) ;  mlap  =  -(Dxx+Dyy); 

if  q  ~=  0  &&  q  ~=  -1,  error(’This  code  only  supports  q  =  0  or  -I.’);  end 


psi  =  [0;psi] ;  ux  =  Dy*psi;  uy  =  -Dx*psi; 

ugradop  =  diag ( sparse (ux) ) *Dx  +  diag ( sparse (uy) ) *Dy; 


if  q  ==  0 

Aop2  =  (-ugradop  +  kappa*mlap) ; 
elseif  q  ==  -1 

Aop2  =  mlap* (-ugradop  +  kappa*mlap) ; 
end 

Aopl  =  (ugradop  +  kappa*mlap) *Aop2 ; 

7,  Solve  for  chi,  dropping  corner  point  to  fix  normalisation, 
chi  =  [0;  Aopl (2 : end, 2 : end)  \  src(2:end)]; 
theta  =  Aop2*chi; 

7,  The  squared  H~q  norm  of  theta. 

varargout{l}  =  L'‘2*sum (theta.  "2) /N~2  *  scalefac; 

if  nargout  >  1 

7.  Gradient  of  squared-norm  Hq2. 

gradHq2  =  2* ( (Dx*theta) . * (Dy*chi)  -  (Dy*theta) . * (Dx*chi) ) ; 
varargout{2}  =  gradHq2(2 : end)  /  N~2  *  scalefac; 
end 


%============================================================= 

function  [c , ceq,gc ,gceq]  =  nonlcon(psi , src , kappa, q,L, scalefac) 

psi  =  [0;psi] ;  N  =  size(src,l); 
c  =  []  ;  gc  =  []  ; 

[Dx,Dy ,Dxx,Dyy]  =  Dif fmat2(N,L)  ;  7«  2D  Differentiation  matrices 
U2  =  L~2* (sum( (Dx*psi) . ~2  +  (Dy*psi)  .  ~2) /N~2)  ; 
ceq(l)  =  (U2-1)  *  scalefac; 

if  nargout  >  2 

7.  Gradient  of  constraints 
mlappsi  =  - (Dxx+Dyy) *psi; 

gceq(:,l)  =  2*mlappsi (2 : end)  /  N~2  *  scalefac; 
end 
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Plankton  dynamics  and  mesoscale  turbulence 

Lecture  given  on  29  June  2010  by  A.  Provenzale 


In  aquatic  ecosystems,  primary  production  (the  transformation  of  inorganic  materials  and 
light  into  living  matter  by  photosynthesis)  is  operated  mainly  by  small,  unicellular  algae 
that  float  freely  in  the  upper  layers  of  oceans  and  lakes  and  are  collectively  called  phytoplank¬ 
ton ,  see  for  an  illustration  the  phytoplankters  depicted  in  figure  1.  Since  phytoplankton 
need  light,  they  are  confined  to  the  water  layer  where  solar  radiation  can  penetrate.  This 
region  is  called  the  “euphotic  layer”  and  it  has  a  maximum  depth  of  about  one  hundred 
meters.  In  the  ocean,  this  roughly  corresponds  to  the  depth  of  the  mixed  layer;  thus,  the 
environment  where  phytoplankton  live  is  a  highly  energetic  fluid  region  characterized  by 
intense  turbulent  mixing. 


Figure  1:  Some  examples  of  phytoplankton  cells 


Aquatic  ecosystems  are  characterized  by  the  essential  role  played  by  fluid  dynamics. 
The  small  organisms  which  compose  the  plankton  are  advected  by  the  surrounding  flow 
and  must  cope  with  environmental  currents,  turbulence,  and  waves.  And  those  organisms 
which  anchor  themselves  to  the  rocks  and  to  the  bottom  must  prevent  being  wiped  away  by 
turbulent  bursts.  Fluid  dynamics  determines  the  motion  of  the  individual  organisms  and,  on 
a  larger  scale,  the  distribution  and  concentration  of  entire  populations.  Fluid  dynamics  and 
turbulence  enter  marine  and  lacustrine  Ecology  at  all  levels,  and  must  be  properly  studied 
and  understood  in  order  to  obtain  a  correct  description  of  aquatic  ecosystem  functioning. 
On  the  other  side,  the  transport  and  advection  of  a  biologically  reactive  tracer,  such  as 
the  plankton  distribution,  or  of  individual  small  living  “impurities”  (as  we  could  call  them) 
in  a  turbulent  flow  make  fluid  dynamics  even  more  intriguing,  opening  up  new  problems 
and  challenges  which  can  stimulate  the  interest  of  the  more  mathematically-oriented  types. 
In  these  three  lectures,  we  shall  touch  upon  some  of  these  issues,  with  no  attempt  at 
completeness  but  rather  barely  scratching  the  surface  of  this  topic. 
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1  Introduction 


Plankton  distributions  in  the  ocean  and  lakes  are  highly  inhomogeneous.  One  of  the  reasons 
for  this  inhomogeneity  lies  in  the  presence  of  strong  horizontal  advection,  associated  with 
mesoscale  turbulence,  coupled  with  the  spatial  inhomogeneity  of  the  nutrient  input  and  the 
nonlinear  dynamics  of  plankton  populations. 

Phytoplankton  are  ‘primary  producers’,  which  means  that  they  take  inorganic  materials, 
such  as  nitrogen  and  carbon,  and  convert  them  into  biomass  via  photosynthesis.  The  main 
limiting  factors  for  phytoplankton  growth  are  light  and  nutrient  availability;  for  this  reason, 
phytoplankton  populations  are  confined  to  the  upper  layers  of  lakes  and  oceans,  in  a  region 
known  as  the  euphotic  layer,  see  figure  1.  In  the  ocean,  nutrients  are  supplied  by  different 
mechanisms:  (1)  upwelling  of  deeper  waters,  the  most  crucial  process  as  the  majority  of 
nutrients  are  generated  by  bacterial  activity  at  depth;  (2)  direct  bacterial  regeneration  of 
nutrients  from  dead  biomass  and  excreted  organic  substances  in  the  euphotic  layer;  and 
(3)  dust  deposition  on  the  surface.  As  photosynthesis  becomes  less  efficient  when  the  solar 
radiation  is  too  intense,  and  since  most  nutrients  come  from  deeper  waters,  phytoplankton 
tend  to  live  near  the  middle  of  the  euphotic  layer. 


Phytop. 

pop 


Nutrients 


Figure  2:  The  structure  of  the  upper  ocean,  as  modeled  in  this  lecture.  Phytoplankton 
populations  are  confined  to  the  euphotic  layer  (turquoise),  where  there  is  enough  sunlight 
for  photosynthesis.  Nutrients  are  supplied  mainly  by  upwelling  of  deeper  waters  into  the 
euphotic  layer,  as  the  highest  concentration  of  nutrients  is  at  depth,  where  sea  snow  tends  to 
settle  and  bacterial  activity  is  strong.  Too  much  sunlight  is  detrimental,  and  phytoplankton 
populations  tend  to  be  found  near  the  middle  of  the  euphotic  layer. 


We  model  meso-  and  large-scale  interactions  between  fluid  dynamics  and  biology  by 
resorting  to  reaction-advection-diffusion  equations.  The  reaction  terms  represent  biologi¬ 
cal  interactions.  The  advection  terms  here  represent  horizontal  advection,  mainly  due  to 
mesoscale  circulations  and  flows.  The  diffusion  terms  are  a  parametrization  of  small-scale 
turbulence  and  of  vertical  mixing  due  to  up-  and  down-welling. 
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2  React  ion- Diffusion  Equations 

Reaction-diffusion  equations  have  the  form 

^  =  f  (P)  +  DV2P,  (1) 

where  p  is  the  concentration  field  of  some  (vector)  quantity,  and  D  is  diffusivity  (assumed 
constant  and  the  same  for  all  field  components).  The  vector  quantity  p  represents  the  con¬ 
centration  of  different  scalar  fields  undergoing  chemical  and/or  biological  interactions.  The 
vector  quantity  f  (p)  represents  the  reactions  between  the  various  components  of  p. 


In  one  spatial  dimension  and  for  a  scalar  field,  this  equation  becomes 


dp 

dt 


f(p)  +  D 


d2p 
dx 2  ’ 


where  x  is  the  spatial  coordinate  and  now  p  is  a  scalar. 


(2) 


The  simplest  biological  example  is  the  KISS  model  for  plankton  blooms  [7,  17].  This 
includes  a  term  describing  Malthusian  (exponential)  growth  and  a  diffusion  term, 


dp 

dt 


vp  +  D 


d2p 
dx 2  ’ 


(3) 


where  v  is  the  exponential  growth  rate  of  a  spatially  homogeneous  population.  Via  Fourier 
analysis,  we  can  find  the  following  expression: 


1 

p(x,t)  = —  /  p(k)  exp(ffex)  exp[(zv  —  Dk2)t]  dk,  (4) 

2vr  J_ oo 


and  so  there  is  instability  (growth)  for  k2  <  v/D  (large  spatial  scales),  while  growth  is 
restrained  at  small  scales.  This  behavior  leads  to  the  presence  of  patchiness  in  the  concen¬ 
tration  field  p  with  the  minimum  scale  2ir(D /v)1^2  ■ 


To  saturate  the  exponential  growth  of  the  homogeneous  population  at  large  times,  one 
can  include  an  extra  term  in  p2,  and  obtain  the  celebrated  Fisher  equation: 


dp 


dt 


P 


-£  =  "P  l-£  +D^, 


K 


d2P 


dx2 ' 


(5) 


where  K  is  a  constant.  We  can  make  this  equation  non  dimensional  by  rescaling  space, 
time  and  the  concentration  field: 


P  , 

P=W  ‘ 


t,  =  1st,  x  — ►  X  = 


v  \  V2 
DJ  ' 
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and  we  obtain,  omitting  the  tilde  for  ease  of  notation, 

dP  (,  x  ,  d2p 

M=p{1-p)  +  W  <6) 

where  all  quantities  are  now  non-dimensional.  The  homogeneous  version  of  this  equation, 
for  dp/dx  =  0,  is  known  as  the  logistic  equation, 

t=pil~p)-  <7) 

The  Fisher  equation  admits  a  traveling  wave  solution  of  the  form  U(z )  =  p{x  —  ct ), 
which  satisfies  the  following  equation: 

?E  +  cW+Uil-U)  =  0,  (8) 

that  is, 

%  =  v'  f =-cv-^-t0  (») 

where  (U,V)  =  ( U,dU/dz ).  This  system  of  ordinary  differential  equations  defines  a  two- 
dimensional  dynamical  system,  which  has  two  fixed  points,  namely  (U,  V )  =  (0,  0)  and 
(U,  V)  =  (1,0).  Linear  stability  analysis  of  the  fixed  points  gives  the  eigenvalues  A±: 


(0,0) 

A±  =  i  -c±  (c2  -  4)1/2  , 

(10) 

(1,0)  - 

->  A±  =  ^  -c±  (c2  +  4)1/2  , 

Thus,  (0,0)  is  a  stable  node  if  c2  >  4,  and  a  stable  focus  if  c2  <  4;  the  fixed  point  (1,0) 
is  a  saddle.  A  stable  focus  would  result  in  negative  values  of  U  on  some  trajectories  near 
(0,  0),  and  thus  c2  >  4  is  the  only  physical  solution.  The  corresponding  phase-plane  picture 
is  shown  in  figure  3,  along  with  a  depiction  of  the  form  of  the  traveling  wave  solution  (from 
[10]). 

For  general  initial  conditions  of  the  form 

p(x,  0)  =  1,  X  <  X\\  p(x,  0)  =  0,  X  >  X2,  (11) 

where  p(x \  <  x  <  X2,0)  is  an  arbitrary  function  which  joins  1  and  0,  at  large  time  the 
solution  will  tend  to  the  traveling  wave  solution,  U(x  —  ct)  with  c  =  2,  see  figure  4. 

If  we  replace  the  second  initial  condition  as  follows 

p(x,  0)  «  Aexp(— ax),  x  — *  oo,  (12) 

see  figure  4,  then  the  system  tends  to  the  solution 

p(x,  t)  «  Aexp(— a[x  —  ct]),  x  — >  oo,  (13) 

where 

c  =  a- 1 —  if  0  <  a  <  1;  c  =  2  if  a  >  1. 
a 
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Figure  3:  Panel  (a)  shows  the  phase  plane  trajectories  for  the  dynamical  system  (9),  rep¬ 
resenting  the  traveling  wave  solution  of  the  Fisher  equation  (6),  and  panel  (b)  shows  the 
traveling  wave  solution  of  equation  (6),  for  c  >  2.  Both  illustrations  are  from  [10]. 


Figure  4:  Qualitative  description  of  the  initial  conditions  which  lead  to  a  traveling  wave 
solution,  U(x  —  ct ),  of  the  reaction-diffusion  equation  (6),  where  panel  (a)  shows  (11)  and 
panel  (b)  shows  (12).  The  linear  form  of  the  initial  conditions  for  x\  <  x  <  x 2  adopted  here 
is  for  illustration  purposes  only. 
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3  Reaction- Advection-Diffusion  Equations  and  Plankton  Dy¬ 

namics 

In  a  fluid  flow,  reactions  are  affected  not  only  by  diffusion,  but  also  by  advection.  In  such 
conditions,  the  equations  for  the  concentrations  of  reactive  tracers  become 

^  +  u-Vp  =  f(p)  +  DV2p,  (14) 

where  u  is  the  advecting  flow  field  and  D  is  the  diffusivity  (again  assumed  constant  and  the 
same  for  all  components  of  the  concentration  field). 

Often,  the  fluid  velocity  u  is  turbulent,  and  this  is  certainly  true  for  the  case  of  large 
and  mesoscale  ocean  flows.  Turbulent  advection  in  aquatic  ecosystems  occurs  on  several 
scales.  Some  of  these  are: 

Micro-scales:  Between  about  1  mm  and  a  few  meters,  the  interactions  of  3-D  turbulence, 
buoyancy  and  individual  plankton  cells  are  important,  as  will  be  discussed  in  the 
following  lecture. 

Meso-  and  submeso-scales:  Between  about  1  km  and  200km,  largely  in  the  form  of 
fronts  and  vortices,  possibly  associated  with  strong  vertical  velocities,  which  modulate 
the  plankton  distribution. 

Large  scales:  Transport  at  ocean  basin  scales  by  large-scale  gyres. 

In  the  ocean,  mesoscale  advection  is  highly  inhomogeneous,  with  very  localized  upwelling 
regions  leading  to  inhomogeneous  nutrient  fluxes;  figure  3  shows  examples  of  mesoscale 
structures  in  the  ocean  dynamical  properties  and  in  the  plankton  distribution.  In  general, 
horizontal  advection  is  associated  with  much  larger  velocities  than  vertical  advection,  and  is 
responsible  for  transporting  nutrients,  phytoplankton  and  zooplankton  as  (almost)  passive 
tracers.  Advection  timescales  in  the  upper  ocean  are  on  the  order  of  1-7  days,  which  are 
similar  to  the  phytoplankton  and  zooplankton  growth  timescales,  which  are  of  the  order  of 
1-2  days  and  1-2  weeks,  respectively.  This  means  that  scale  separation  between  the  reaction 
and  the  advection  time  scales  (which  could  allow  for  some  simplifications)  is  not  possible. 

In  keeping  with  the  tradition  of  considering  idealized  situations,  we  introduce  a  simpli¬ 
fied  description  of  mesoscale  turbulence  in  the  ocean.  Mesoscale  ocean  flows  are  dominated 
by  rotation  and  are  usually  characterized  by  stable  stratification  (ignoring  the  localized 
regions  of  intense  deep  convection  in  some  polar  areas  and  in  the  Gulf  of  Lyons).  In  such 
conditions,  one  of  the  simplest  (certainly  too  simple)  descriptions  of  mesoscale  ocean  flows 
is  provided  by  quasigeostrophic  turbulence  in  its  barotropic  version,  and,  discarding  the 
effects  of  the  free  surface  and  of  differential  rotation  (the  /3-plane  effect),  by  the  formulation 
known  as  two-dimensional  (2D)  turbulence.  Of  course,  this  flow  model  cannot  be  taken 
as  a  true  representation  of  mesoscale  ocean  flows  but  rather  as  an  idealized  picture  that 
has  some  of  the  properties  of  the  real  flow  (for  example,  the  presence  of  intense  coherent 
vortices) . 
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Figure  5:  Examples  of  mesoscale  structures  in  plankton  populations  and  in  the  ocean  tem¬ 
perature  and  velocity  fields.  Panel  (a)  shows  the  concentration  of  chlorophyll-a  around 
Tasmania  [11].  Panel  (b)  shows  a  phytoplankton  bloom  of  Nodularia  spumigena  in  the 
Baltic  sea  [12].  Panel  (c)  shows  the  intricate  mesoscale  structures  in  sea-surface  temper¬ 
ature  in  the  North  Atlantic  [13].  Panel  (d)  shows  the  velocity  field  (arrows)  and  the  sea 
surface  height  variations  (colorscale)  in  a  patch  of  the  Mediterranean  Sea,  near  the  coast  of 
France  and  Italy  [2]. 
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(15) 


The  equation  of  motion  for  2D  turbulence  is  written  as 

d(  d(  d( 

777  +  — I"  v^r  —  D  +  F, 

at  ox  ay 

where  D  is  a  (hyper)diffusion  term,  F  is  forcing,  Q  is  vorticity, 

.  dv  du  _ n  , 

"  7h-  Thj  v' 


(16) 


and  ^  is  the  stream  function.  The  horizontal  velocities  are  given  by  u  =  —d'tjj/dy  and 
v  =  dijj/dx.  Many  theoretical  and  numerical  explorations  of  this  system  have  shown  that  a 
random  initial  vorticity  field  organizes  into  a  collection  of  coherent  vortex  structures  which 
contain  most  of  the  kinetic  energy  and  enstrophy  (squared  vorticity)  of  the  flow  (see  [9] 
for  a  pioneering  work  on  this  subject).  Figure  6  shows  an  example  of  the  vorticity  field 
produced  by  numerical  integration,  with  a  pseudo-spectral  code,  of  the  forced  and  dissi¬ 
pated  2D  turbulence  equation  (15),  see  [14]  for  further  details.  Here  we  use  hypeviscosity 
at  small  scales  and  large-scale  dissipation  to  balance  the  forcing,  obtained  by  keeping  fixed 
the  power  spectrum  at  a  given  wavenumber  kf.  After  a  transient,  the  flow  settles  into  a 
statistically  stationary  state  where  the  average  energy,  enstrophy  and  number  of  coherent 
vortices  oscillate  around  a  constant  value. 


Figure  6:  Vorticity  field,  £,  from  the  numerical  simulation  of  equation  (15)  with  forcing  and 
dissipation,  showing  the  presence  of  intense  coherent  vortices. 


4  The  role  of  localized  upwelling  regions 

Plankton  growth  depends  on  the  availability  of  nutrients,  which  have  to  be  supplied  to  the 
oceanic  mixed  layer  to  balance  biological  consumption.  In  turn,  the  nutrient  input  fluxes 
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Figure  7:  Green  regions  indicate  where  the  intense  nutrient  flux  takes  place.  The  left  panel 
refers  to  the  case  with  a  single  region  of  strong  upwelling  (RF1),  while  the  right  panel  shows 
a  snapshot  of  the  case  where  the  active  regions  are  fragmented  (RFn,  here  n=13).  The  total 
area  of  the  regions  with  strong  upwelling  is  approximately  the  same  in  the  two  cases. 


are  affected  by  different  physical  mechanisms  such  as  turbulent  mixing,  Ekman  pumping, 
convection,  transport  from  the  boundaries  etc.  Such  mechanisms  act  on  a  variety  of  scales 
providing  spatially  and  temporally  localized  fluxes  of  nutrients  to  the  biological  system. 
Thus,  it  is  useful  to  understand  the  behavior  of  a  biological  model  in  response  to  such  in¬ 
homogeneous  and  intermittent  forcings. 

The  plankton  ecosystem  model  that  is  used  here  includes  three  components,  that  repre¬ 
sent  nutrient,  N,  phytoplankton,  P,  and  zooplankton,  Z  (see  [16]  for  details).  The  dynamics 
of  the  ecosystem  is  described  by  the  reaction-advection-diffusion  equations 

~Dt  -  -  P1^TNP+ 

+m  ((1  -  +  upP  +  LizZ2)  +  dv2n 

t£=  PisHnP  -  -  f  pP  +  <17) 

‘m  =  +  »V2Z  . 

where  D/Dt  =  d/dt  +  ud/dx  +  vd/dy. 

The  terms  on  the  right  hand  side  of  the  equation  for  the  nutrient  represent  respectively 
vertical  nutrient  supply  from  deep  water,  nutrient  consumption  by  phytoplankton,  bacterial 
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regeneration  of  the  dead  organic  matter  into  nutrients  in  the  euphotic  layer,  and  diffusion. 
Bacterial  processes  are  here  assumed  to  be  faster  than  the  other  processes;  consistently, 
we  do  not  model  bacterial  dynamics  explicitly  and  assume  that  there  is  a  large  bacterial 
population  which  adjusts  instantaneously  to  the  availability  of  dead  matter  to  decompose. 
All  variables  are  measured  in  unit  of  nitrogen  concentration. 

Phytoplankton  dynamics  is  regulated  by  production,  dependending  on  available  nutri¬ 
ents  through  a  Holling  type-II  (or  Michaelis-Menten)  functional  response,  by  a  Holling  type 
III  grazing  by  zooplankton,  by  linear  mortality  and  by  diffusion  (see  for  example  [8]  for  an 
overview  of  mathematical  models  in  Ecology  and  their  jargon).  Finally,  zooplankton  grows 
when  phytoplankton  is  present  (7  is  the  assimilation  efficiency  of  the  zooplankton) ,  and  has 
a  quadratic  mortality  term  used  to  close  the  system  and  parameterize  the  effects  of  higher 
trophic  levels. 

In  the  nutrient  equation,  the  term  /i?v  is  smaller  than  one  and  represents  the  fact  that 
not  all  biological  substance  is  immediately  available  as  nutrient:  the  fraction  (1  —  /j,v)  is 
lost  by  sinking  to  deeper  waters.  Note,  also,  that  nutrient  enters  this  model  by  affecting 
the  growth  rate  of  phytoplankton.  Since  the  formulation  adopted  here  is  two-dimensional 
in  the  horizontal  and  no  vertical  structure  of  the  fields  is  allowed,  vertical  upwelling  has  to 
be  parameterized. 

Nutrient  is  brought  to  the  surface  from  deep  water  by  (isopycnal  and  diapycnal)  turbu¬ 
lent  mixing  and  upwelling.  When  these  processes  are  sufficiently  intense,  it  is  reasonable  to 
assume  that  the  surface  water  becomes  saturated  in  nutrients  (with  respect  to  the  nutrient 
content  in  deep  water)  and  further  mixing  does  not  change  the  concentration  of  available 
nutrients.  Only  when  nutrient  is  removed  (by  phytoplankton  growth  and/or  horizontal  dis¬ 
persion),  vertical  mixing  becomes  effective  again.  This  situation  can  be  represented  by  a 
relaxation  flux,  where  the  concentration  of  nutrient  at  the  surface  relaxes  to  a  value  that 
depends  on  the  nutrient  content  in  the  deep  reservoir.  The  nutrient  supply  can  then  be 
written  in  restoring  form, 

=  ~s(x,y)  (N  -  N0)  ,  (18) 

where  No  is  the  (constant)  nutrient  content  in  deep  waters  and  s  is  the  (spatially  varying) 
relaxation  rate  of  the  nutrient,  which  is  large  in  regions  of  strong  vertical  mixing  and  small 
in  regions  of  weak  vertical  mixing.  This  form  can  also  be  interpreted  as  the  finite-difference 
approximation  to  a  vertical  turbulent  advective  term  that  acts  between  two  layers  with 
nutrient  concentration  N  and  No,  and  it  is  the  standard  formulation  used  for  chemostat 
models  when  the  reservoir  has  infinite  capacity. 

Advection  is  simulated  by  the  action  of  a  barotropic  2D  turbulent  flow  -  an  approxima¬ 
tion  to  mesoscale  turbulence  in  the  ocean.  We  consider  the  case  of  statistically  stationary 
2D  turbulent  field,  forced  at  the  non-dimensional  wavenumber  k  =  40  (that  is,  at  a  length 
scale  which  is  l/40th  of  the  domain)  and  having  resolution  of  5122  grid  points.  Assuming 
that  the  forcing  scale  corresponds  to  the  typical  size  of  an  eddy,  of  about  25  km,  then  the 
domain  size  becomes  1000  km  in  dimensional  units  and  the  resolution  is  about  2  km.  The 
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turbulent  velocity  field  has  mean  eddy  turnover  time  Te  =  2.8  days. 


The  nutrient  sources  are  represented  by  a  given  number  of  randomly  distributed,  circu¬ 
lar  regions  with  fixed  radius  where  nutrient  input  is  stronger  than  average.  In  our  simplified 
view,  we  simulate  the  strong  nutrient  flux  in  these  patches,  which  we  call  “active  regions”, 
by  a  value  of  the  relaxation  rate,  s  =  sa,  which  is  100  times  larger  than  the  relaxation  rate 
in  the  rest  of  the  domain,  which  we  call  the  “passive  region”  and  where  s  =  sp  with  sp  <C  sa. 

In  all  the  numerical  simulations  discussed  here,  the  turbulent  velocity  field  is  the  same. 
What  changes  from  one  simulation  to  another  is  the  spatial  arrangement  of  the  region  where 
the  flux  is  strong  (i.e.,  of  the  “active”  region).  While  the  total  area  of  the  active  region  with 
strong  upwelling  is  kept  constant  at  the  12%  of  the  domain  area,  the  intense  upwelling  is 
respectively  confined  to  a  circular  patch  at  the  center  of  the  domain  (RF1),  or  to  a  number 
of  small  patches  randomly  distributed  in  the  domain  (RFn,  where  n  indicates  the  number 
of  individual  patches).  The  two  types  of  active  regions  are  shown  in  figure  7. 

To  characterize  the  system  behavior,  we  estimate  the  mean  primary  production,  defined 
as  PP  =  (/ 3NP/{kn  +  N)),  where  the  angular  brackets  indicate  average  over  the  whole 
domain.  Primary  production  can  be  taken  as  an  indicator  of  the  efficiency  of  the  biological 
model  to  convert  inorganic  into  organic  matter. 

The  horizontal  stirring  induced  by  the  turbulent  velocity  field  displaces  parcels  of  water 
that  are  rich  in  nutrient  and  planktonic  life  into  areas  of  limited  upwelling,  where  the  supply 
is  not  sufficient  to  sustain  the  biological  activity  at  the  level  present  in  the  parcel.  Vicev- 
ersa,  parcels  with  poor  nutrient  content  and  limited  planktonic  abundance  can  be  displaced 
into  active  regions  where  the  newly  available  nitrate  can  stimulate  a  plankton  bloom.  This 
effect  depends  on  the  intensity  of  turbulent  stirring.  The  exchange  rate  of  water  parcels 
between  active  and  inactive  regions  depends  on  the  flow  characteristics  and  on  the  amount 
of  parcels  that  are  close  to  the  boundary  areas  between  active  and  inactive  regions,  where 
there  are  strong  gradients  in  the  biogeochemical  properties  of  water.  The  more  fragmented 
are  the  upwelling  regions,  the  larger  are  the  boundary  areas.  Experiments  RF1  and  RFn  are 
designed  to  illustrate  the  effects  of  a  variation  in  the  stirring  rate  on  the  biological  activity 
when  the  size  of  the  boundary  areas  with  strong  biogeochemical  gradients  is  changed  from 
small  (RFl)  to  large  (RFn),  while  the  total  upwelling  area  is  kept  constant. 

The  enhanced  stirring  induced  by  advection  increases  the  mean  flux  from  deep  waters. 
The  enhanced  flux  originates  at  active  locations  when  a  parcel  of  water  that  has  low  nutrient 
content  is  advected  over  them.  To  see  how  this  happens,  consider  two  nearby  parcels:  one  is 
in  a  region  with  small  nutrient  upwelling  and  characterized  by  a  steady-state  nutrient  con¬ 
centration  N* ;  the  other  is  in  an  active  region  and  has  a  steady-state  nutrient  concentration 
N*.  In  this  configuration,  the  total  nitrate  flux  associated  with  these  two  parcels  of  water  is 
(sp(No  —  N* )  +  sa{N()  —  TV*)).  Suppose  now  that,  due  to  advection,  the  two  parcels  switch 
their  position:  the  parcel  with  small  nutrient  content  gets  in  a  strong  upwelling  region  and 
viceversa.  In  this  configuration,  the  vertical  flux  is  ( sp(No  —  N*)  +  sa(No  —  N*)') .  The  net 
variation  of  the  nutrient  flux  is  (sa  —  sp)(N*  —  N* ) ,  which  is  proportional  to  sa  —  sp.  This 
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Figure  8:  On  the  left  is  the  time  evolution  of  the  average  primary  productivity.  Different 
curves  represent  simulations  with  different  number  of  fragmented  sources  (from  1  to  7). 
On  the  right  is  the  dependence  of  the  long-term  primary  productivity  (that  is,  the  value 
achieved  after  the  transient  of  panel  a)  on  the  radius  of  the  active  regions.  The  total  area 
of  the  regions  with  strong  upwelling  is  approximately  the  same  in  the  two  cases. 


term  is  positive  as  larger  relaxation  rates  are  found  in  regions  with  strong  vertical  mixing. 
The  enhanced  nutrient  flux  is  thus  due  to  the  asymmetry  in  the  relaxation  times  between 
the  active  and  inactive  regions.  Note,  also,  that  the  exchange  rate  of  water  parcels  between 
the  two  types  of  region  directly  affects  the  increased  nutrient  flux  to  the  surface,  determin¬ 
ing  larger  values  of  the  nutrient  flux  in  the  case  RFn  than  in  the  case  RF1. 

The  results  reported  here  indicate  that  primary  production  has  a  strong  dependence  on 
the  fragmentation  level  of  the  localized  forcing,  which  is  in  turn  associated  with  the  presence 
and  the  properties  of  mesoscale  structures.  These  results  imply  that  the  functioning  of 
marine  ecosystems  is  sgnihcantly  affected  by  the  flow  structure  and  by  the  distribution  of 
the  upwelling/downwelling  regions,  which  is  determined  by  the  nature  of  mesoscale  and 
submesoscale  turbulence. 

5  Advection  over  topography 

As  discussed  above,  primary  productivity  in  the  ocean  depends  on  the  availability  of  nutri¬ 
ents  in  the  euphotic  layer,  which  in  turn  depends  on  the  presence  of  upwelling  which  brings 
nutrients  from  deeper  waters. 

The  properties  of  upwelling  are  determined  by  the  structure  of  the  vertical  velocities. 
Vertical  velocities  are  greatly  affected  by  the  variations  of  topography  which  take  place  at 
steep  continental  shelves  and  near  seamounts  -  areas  where  large  abundances  of  plankton 
and  fish  are  usually  observed.  In  the  following,  we  discuss  the  impact  of  a  vortex  propagat¬ 
ing  across  a  steep  topographic  slope  on  the  dynamics  of  the  plankton  population  [19]. 

The  biological  model  is  the  same  NPZ  model  used  in  the  previous  section,  with  the  only 
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Figure  9:  A  cyclonic  vortex  on  a  sloping  bottom  topography.  Vorticity  is  on  the  left; 
divergence  on  the  right. 
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Figure  10:  Contours  of  horizontal  divergence  and  N,  P  and  Z  fields  at  t=8  days.  Biological 
fields  are  plotted  as  deviations  from  their  equilibrium  values. 


98 


Point  A 


Point  B 


1 


7 

0.8 

E 

z 

2 

0.6 

E. 

N 

0.4 

CL 

Z 

0.2 

0 


Figure  11:  Time  series  of  N,P  and  Z  and  flow  divergence  for  two  points  at  depth  450  meters. 
Solid  line:  N,  dashed:  P,  dashed  dotted:  Z. 
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difference  that  now  the  vertical  nutrient  supply  is  given  by  the  term  <£at  =  —  (sv  +  sn)(N  — 
Nq),  where  the  rate  sv  is  proportional  to  the  vertical  velocity  of  the  flow  and  sn  represent 
a  weak  positive  nutrient  supply  in  normal  conditions  that  allows  for  sustaining  a  non-zero 
population  at  all  times.  In  the  proximity  of  a  vortex,  sv  3>  sn. 

The  value  of  sv  is  determined  by  a  Quasi-2D  dynamical  flow  model  in  the  presence 
of  topography  [4],  The  flow  is  set  in  motion  by  initially  placing  a  cyclonic  vortex  on  the 
escarpment,  see  figure  9.  The  vortex  is  affected  by  the  topography  and  it  drifts  leftward, 
according  to  a  topographic  /3-effect,  generating  a  set  of  propagating  topographic  Rossby 
waves  associated  with  a  pattern  of  vorticity,  divergence  and  vertical  velocity  with  alternat¬ 
ing  sign.  Such  motion  provides  an  oscillating  source  of  nutrients  to  the  ecological  model. 

Figure  10  shows  a  snapshot  of  the  flow  divergence  and  of  the  N,  P  and  Z  fields  after 
8  days,  when  the  topographic  wave  is  travelling  along  the  coast.  The  time  series  for  N,  P 
and  Z  and  the  flow  divergence  are  shown  in  figure  (11)  for  two  points  located  in  the  sloping 
region  at  a  depth  of  450  m  and  spaced  100  km  from  each  other.  The  divergence  time  series 
clearly  shows  an  oscillatory  pattern  with  a  period  of  about  11  days.  The  amount  of  nutri¬ 
ents  increases  whenever  there  is  positive  divergence  (that  is,  upward  vertical  velocities)  and 
decreases  if  it  is  negative  (that  is,  a  downward  velocity).  As  a  response  to  this  oscillating 
flux  of  nutrients,  oscillatory  behavior  is  observed  also  in  the  phyto-  and  zooplankton  concen¬ 
trations.  This  suggests  that  point  measurements  of  ecological  fields  over  steep  slopes  could 
reflect  the  characteristics  of  topographic  waves  rather  than  internal  ecosystem  dynamics;  a 
proper  interpretation  of  the  measurements  should  thus  take  into  account  the  properties  of 
the  Rossby  waves. 

6  Mesoscale  Turbulence  and  Coexistence 

As  a  final  example,  we  discuss  how  mesoscale  vortices  affect  species  coexistence.  To  study 
this  problem,  we  consider  a  system  with  one  limiting  nutrient,  N,  and  two  phytoplankton 
populations,  P±  and  P2  (see  [1,  15]  for  further  details).  The  system  is  assumed  to  be  in  a 
chemostat,  i.e.  there  is  a  constantly  replenishing  supply  of  N  and  a  constant  loss  of  Pj : 


DN 
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DP i 
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-so(N-No) 


1  PiNPi 

pi  ki  +  N 


1  P2NP2 

P2  ^2  T  N 


+  DV2N, 


PiNPj 

h  +  N 


PiPi  +  DV~Pi  i  —  1,2. 


(19) 

(20) 


The  term  —sq(N  —  Nq)  is  the  nutrient  input,  described  as  a  relaxation  to  the  nutrient 
concentration  in  the  deep  reservoir,  JVo,  with  a  rate  so-  The  terms  represent  nutrient 

uptake  by  phytoplankton,  using  a  Michaelis-Menten  (or  Monod)  functional  form.  The  con¬ 
stants  pi  and  P2  are  used  to  transform  phytoplankton  biomass  into  nutrient  concentration  if 
they  are  described  by  different  units  (for  example,  nitrogen  concentration  for  nutrient  and 
carbon  or  biomass  for  phytoplankton).  The  terms  PiPt  are  natural  plankton  mortalities 
(which  include  possible  sinking  of  the  plankton  cells)  and  the  last  terms  represent  diffusion. 
Again,  the  total  time  derivative  is  D/Dt  =  d/dt  +  ud/dx  +  vd/dy.  No  direct  bacterial 
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regeneration  term  has  been  included. 


In  homogeneous  conditions  at  equilibrium,  there  is  no  co-existence  of  the  two  plankton 
populations,  and  only  the  most  favored  species  survives,  as  determined  by  the  values  of  the 
different  parameters.  In  fact,  the  so-called  Principle  of  Competitive  Exclusion  [3,  5]  states 
that  if  two  species  are  too  similar,  they  cannot  coexist  in  equilibrium:  whenever  two  species 
compete  for  the  same  resource,  the  most  favoured  will  survive  and  the  less  favoured  will 
eventually  go  locally  extinct.  By  extension,  at  equilibrium  the  number  of  species  compet¬ 
ing  on  the  same  resources  cannot  be  larger  than  the  number  of  resources.  Phytoplankton, 
however,  seem  to  escape  this  limitation,  since  a  large  number  of  species  that  compete  for 
the  same  few  resources  is  usually  observed.  This  phenomenon,  known  as  the  Paradox  of  the 
Plankton ,  was  formulated  by  Hutchinson  about  fifty  years  ago  [6].  Competition  is  avoided 
in  many  instances  by  partitioning  space  and/or  time:  the  unfavoured  species  may  be  segre¬ 
gated  in  a  spatial  environment  forbidden  to  the  stronger  species,  or  the  two  species  might 
perform  differently  at  different  times.  Mesoscale  vortices  can  be  one  of  the  causes  of  the 
spatial  segregation  of  unfavoured  and  favoured  competitors,  and  the  sheltering  effect  offered 
by  the  vortices  can  allow  unfavoured  competitors  to  survive  for  prolonged  periods  of  time. 

To  study  this  issue,  we  consider  system  (20)  with  initially  inhonogeneous  plankton  dis¬ 
tributions  and  consider  two  alternatives  for  the  flow  advection: 

1.  A  stochastic  process  representing  a  random  walk  with  memory;  the  various  ‘kicks’  are 
correlated.  This  flow  has  no  coherent  structures. 

2.  2D  turbulence,  with  coherent  vortices. 

For  the  stochastic  process,  we  use  a  Ornstein-Uhlenbeck  (Langevin)  process  [18],  defined 
as: 
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(21) 


where  TjJ  is  the  velocity  autocorrelation  time,  a  is  the  standard  deviation  of  velocities,  and 
Wx ,  Wy  are  white- noise,  Gaussian  Wiener  processes  characterized  by 


(dW)  =  0 

(dW(t)dW{t'))  =  S(t  —  t')dt, 


(22) 

(23) 


such  that  the  noise  intensities  at  different  times  are  not  correlated.  Given  these  assumptions, 
the  autocorrelation  of  the  velocity  components  is 


R(t) 


(u(t)u(t  +  t)) 

(um 


exp  (— t/Tl), 


(24) 
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Figure  12:  Panels  (a)  and  (b)  show  snapshots  from  numerical  simulations  of  two  competing 
species,  in  white  and  black,  described  by  equations  (19,20),  at  the  beginning  (when  they  are 
saparate  in  two  different  regions)  and  at  a  later  time,  when  advection  has  transported  them 
across  the  flow  domain.  The  left  panels  in  both  a)  and  b)  refer  to  advection  by  the  Ornstein- 
Uhlenbeck  process  described  in  equation  (21),  and  the  right  panels  refer  to  advection  by 
2D  turbulence.  Panel  (c)  shows  the  concentration  of  the  less  favoured  population  over 
time  for  the  Ornstein-Uhlenbeck  process  (dotted  line)  and  for  2D  turbulence  (solid  line). 
The  most  favoured  population  ‘wins’  in  both  cases,  but  the  vortices  of  2D  turbulence  act 
as  transport  barriers  and  slow  down  the  contact  of  the  two  populations  and  the  rates  of 
biological  reactions. 


102 


such  that  velocities  are  correlated  on  a  timescale  T^.  The  probability  distribution  of  the 
velocity  components,  p(u)  =  p{y )  since  the  flow  is  statistically  isotropic,  is  given  by 


p(u) 


vk exp  {-h)  ■ 


(25) 


Figure  12  shows  the  comparison  between  the  plankton  concentrations  obtained  with  the 
two  types  of  advecting  flows,  simulated  in  the  same  doubly  periodic  domain  and  with  the 
same  kinetic  energy  and  decorrelation  times,  comparable  with  those  of  mesoscale  turbulence 
in  the  upper  layer  of  the  ocean.  The  most  favoured  population  (here  represented  in  white) 
‘wins’  in  both  cases,  but  it  takes  much  longer  in  the  case  of  2D  turbulence.  In  general, 
in  2D  turbulence  the  vortices  act  to  segregate  the  populations  and  allow  the  less  favoured 
population  to  survive  longer,  providing  a  shelter  area.  In  case  the  two  populations  have 
different  fitnesses  in  different  seasons,  the  presence  of  the  vortices  allows  for  the  survival  of 
the  temporarily  less  fit  population  and  for  a  long-term,  seasonally  oscillating  coexistence 
between  the  two  plankton  populations.  In  conclusion,  these  results  indicate  that  advection 
significantly  affects  biological  reaction  rates,  and  so  it  must  be  taken  into  account  when 
studying  the  dynamics  of  marine  ecosystems. 


Conclusion 

In  this  Lecture  we  saw  that  the  spatial  and  temporal  distribution  of  plankton  at  scales 
between  a  few  km  and  a  few  hundred  km  is  heavily  affected  by  the  properties  of  mesoscale 
(and  subnresoscale,  but  we  did  not  explore  it  here)  turbulence.  If  there  is  a  message  to  be 
extracted  from  these  examples,  it  is  that  the  functioning  of  aquatic  ecosystems  is  largely 
dependent  on  the  fluid  dynamics  and  that  turbulence  cannot  be  ignored. 
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Plankton  cells  in  turmoil 

and  the  dynamics  of  heavy  impurities  with  finite  size 

Lecture  given  on  30  June  2010  by  A.  Provenzale 


1  Introduction 

In  this  lecture  we  shall  focus  on  the  dynamics  of  individual  phytoplankton  cells  and  on 
simple  models  to  describe  their  motion  in  the  fluid  flow.  Phytoplankton  cells  are  small, 
with  dimensions  up  to  a  few  hundred  microns.  Individual  phytoplankton  cells  immersed  in 
water  are  advected  by  the  surrounding  flow,  even  though  some  species  can  resist  passive 
advection  by  various  means,  such  as  varying  their  buoyancy  and/or  swimming.  In  addition, 
most  plankton  cells  are  slightly  heavier  than  water  and  are  thus  subject  to  gravitational 
sinking  (i.e.,  they  are  negatively  buoyant). 

If  we  want  to  describe  the  dynamics  of  a  phytoplankton  cell  advected  by  a  flow,  we  need 
to  write  the  equations  of  motion  for  the  cell.  However,  real  phytoplankton  cells  usually 
have  a  very  complex  shape  (which  can  change  with  time),  are  non-homogeneous  in  their 
interior,  can  vary  their  buoyancy  and  can  emit  substances  that  change  the  local  fluid  vis¬ 
cosity.  In  addition,  they  swim,  grow,  reproduce  (split),  and  eventually  die  or  are  eaten  by 
zooplankton  (small,  usually  multicellular  organisms  which  feed  on  algae).  For  fluffy  bodies 
with  spines,  tails  and  flagella  and  an  active  life  such  as  phytoplankters,  the  equations  of 
motion  are  rather  complicated,  and  one  should  resolve  the  Navier-Stokes  equations  with 
the  correct  boundary  conditions  on  the  mobile  surface  of  the  little  plankter. 

In  this  lecture  we  follow  a  traditional  path  for  the  GFD  school,  and  simplify  -  even 
grossly  oversimplify  -  the  problem,  by  considering  the  dynamics  of  infinitesimally  small, 
spherical  heavy  particles,  with  fixed  shape  and  density,  which  are  passively  advected  by  a 
surrounding  fluid  flow,  and  we  call  them  “impurities.”  Of  course,  our  impurities  are  but  a 
far  image  of  phytoplankton  cells  in  the  ocean. 

2  Plankton  and  turbulence:  a  matter  of  size 

Most  fluid  flows  in  the  ocean  and  in  lakes  are  turbulent,  especially  in  the  upper  mixed  layer 
where  phytoplankton  thrive.  Thus,  on  large  scales,  the  world  in  which  plankton  cells  live  is 
a  turbulent  environment.  However,  individual  cells  are  small,  usually  smaller  than  the  Kol¬ 
mogorov  scale  -  the  length  scale  which  marks  the  transition  from  laminar  flow  at  smaller 
scales  to  turbulent  dynamics  at  larger  scales.  For  this  reason,  individual  phytoplankters 
experience  a  viscous,  laminar  and  non-turbulent  flow  in  their  immediate  surroundings. 
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The  motion  of  sea  or  lake  water  under  “normal”  conditions  (ignoring  compressibility, 
thermodynamics,  phase  transitions  and  other  complications)  can  be  described  by  the  in¬ 
compressible  Navier-Stokes  equations, 

Pjjj  =  -Vp  +  -  pgi  +  F  (1) 

where  u(x,  t)  is  the  fluid  velocity,  x  is  the  spatial  coordinate,  t  is  time,  p  is  the  fluid  density, 
p  is  pressure,  p,  is  the  dynamic  viscosity,  g  is  the  gravitational  acceleration,  z  is  the  unit 
vector  in  the  z  direction,  F  is  a  generic  body  force  (if  present)  and  the  substantial  (or 
Lagrangian)  derivative  can  be  expressed  as  Du/Dt  =  du/dt  +  u  •  Vu.  We  have  fixed  the  z 
coordinate  to  point  vertically  upwards.  If  p  is  assumed  to  be  constant,  the  gravitation  term 
can  be  absorbed  in  the  definition  of  pressure. 

Equation  (1)  is  complemented  by  the  incompressible  version  of  the  continuity  equation, 

V  •  u  =  0  ,  (2) 

and  in  this  way  we  get  a  complete  description  of  the  fluid  flow,  four  equations  in  four  vari¬ 
ables.  Of  course,  to  (try  to)  solve  the  equations  we  also  need  boundary  and  initial  conditions. 

The  Reynolds  number  can  be  defined  as  the  ratio  between  the  strength  of  the  nonlinear 
term,  p\u-  Vu|,  and  that  of  the  viscous  term,  |^V2u|.  If  we  call  U  and  L  the  typical  scale  of 
velocity  and  length,  and  scale  time  by  L/U,  we  get  that  the  Reynolds  number  is  R e=UL/u 
where  v  =  p,/ p  is  the  kinematic  viscosity.  When  the  Reynolds  number  is  smaller  than  one, 
i.e. ,  the  velocity  and/or  the  length  scale  are  sufficiently  small,  the  flow  is  laminar  and  viscos¬ 
ity  dominates.  For  Re-^  0,  we  can  discard  the  inertial  term  in  the  Navier-Stokes  equations 
and  find  viscosity-dominated  special  solutions.  When  ReS>  1,  the  flow  becomes  turbulent 
and  the  fluid  velocity  varies  intermittently  and  erratically  in  time  and  space,  spanning  a 
wide  range  of  values. 

The  simplified  form  of  the  Navier  Stokes  equations  given  by  eq.(l,2)  with  constant 
density  describes  the  cleanest  case  of  turbulence,  i.e.,  homogeneous  and  isotropic  three- 
dimensional  turbulence.  Physically,  this  type  of  flow  is  a  good  model  for  turbulence  for 
scales  up  to  a  few  meters,  a  reasonably-sized  portion  of  the  world  for  a  phytoplankton  cell. 

To  quantitatively  describe  this  type  of  turbulence,  we  assume  that  the  energy  input  in 
the  flow  takes  place  at  a  certain  macro  length  scale,  Iq.  and  dissipation  takes  place  mainly 
at  a  very  small  scale,  say  Id-  In  the  scale  range  between  Iq  and  Id ,  energy  gets  transferred 
to  smaller  and  smaller  length  scales  by  the  breakdown  of  the  larger  eddies  into  smaller  ones. 
Here,  an  “eddy”  is  a  generic,  usually  short-lived,  vortical  structure  in  the  three-dimensional 
turbulent  flow,  and  it  is  characterized  by  a  given  (approximate)  length  scale.  If  Iq  3>  Id 
and  no  forcing  is  present  between  these  two  scales,  then  we  see  the  emergence  of  an  inertial 
range  where  the  dynamics  is  independent  of  dissipation  and  forcing.  Assuming  that  the 
spectral  energy  transfer  rate  is  constant  in  the  inertial  range,  so  to  obtain  a  statistically 
stationary  energy  spectrum,  and  using  simple  scaling  arguments,  one  can  derive  the  form  of 
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the  energy  spectrum  in  the  inertial  range,  E(k),  which  obeys  the  celebrated  Kolmogorov’s 
-5/3  law, 

E(k)  ~  e2/3A;-5/3  (3) 

where  k  is  the  wavenumber  (i.e.,  2tt/\  where  A  is  the  wavelength)  and  e  is  the  energy  dis¬ 
sipation  rate  per  unit  volume. 


Figure  1:  Energy  spectrum  of  a  three-dimensional,  homogeneous  and  isotropic  turbulent 
flow. 

The  length  scale  at  which  diffusion  becomes  important  is  called  the  Kolmogorov  scale, 
lv ,  defined  as 

lu  ~  k-1  ~  (vZ/e)1'4  .  (4) 

At  scales  smaller  than  lu,  the  flow  is  laminar  (but  possibly  unsteady).  In  the  ocean,  lv 
ranges  between  6  x  10-5  m  and  0.01  m.  Since  most  phytoplankton  cells  are  smaller  than  a 
few  tens  of  microns  in  size,  they  do  not  experience  a  turbulent  flow  but  are  rather  exposed 
to  a  viscous  laminar  flow. 


The  dominance  of  viscosity  makes  nutrient  uptake  by  phytoplankton  to  be  severely 
limited  by  molecular  diffusion.  For  a  plankton  cell,  nutrient  delivery  happens  in  two  stages 
-  nutrient  delivery  to  the  cell  membrane  and  transport  across  the  membrane.  When  the 
latter  process  is  faster  than  the  first  one  (as  often  happens),  the  nutrient  concentration  at 
the  cell  surface  drops  significantly  below  its  far-held  value.  For  a  molecular  diffusivity  D 
and  a  plankton  cell  of  radius  a,  the  concentration  of  nutrient  in  the  proximity  of  the  cell 
wall  is  governed  by  the  equation 


=  /;v2.V 

dt 


(5) 


whose  spherically  symmetric,  steady  state  solution  is 

N(r)  =  -(N0-Noo)  +  Noo  (6) 

r 

where  Nq  is  the  concentration  at  the  cell  surface  and  N, ^  is  the  far-held  concentration.  The 
diffusional  hux  to  a  spherical  cell  is  then  computed  as 


Qd  =  47rDa(lV00  -  No)  . 


(7) 
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When  the  uptake  capacity  of  the  cell  exceeds  the  rate  of  nutrient  supply  by  diffusion,  a 
nutrient-depleted  layer  is  formed  near  the  cell.  The  time  taken  by  diffusion  for  transporting 
the  nutrient  across  this  layer  can  be  estimated  as 


Td  = 


D 


(8) 


In  water  at  standard  upper  ocean  temperatures,  for  innnotile  cells  with  a  <  lO^um  the  nu¬ 
trient  limitation  can  be  severe. 


Given  that  diffusion  is  not  sufficient  to  bring  enough  nutrients  to  the  cell  surface,  es¬ 
pecially  for  smaller  organisms,  phytoplankters  have  found  ways  to  overcome  diffusion  lim¬ 
itation.  Most  of  these  strategies  are  based  on  the  ability  of  phytoplankters  to  move  along 
trajectories  which  are  not  fluid  particle  paths,  thus  allowing  them  to  explore  new  regions  of 
the  fluid  which  are  not  nutrient  depleted  yet.  The  differences  between  cell  trajectories  and 
fluid  parcel  trajectories  can  be  generated,  for  example,  by 

1.  Swimming  (as  many  flagellates  do); 

2.  Sinking  (as  diatoms  do); 

3.  Exploiting  the  fact  that  plankton  cells  have  different  density  from  the  fluid  and  have 
finite  size. 

In  the  rest  of  this  Lecture,  we  shall  focus  on  the  last  mechanism  and  study  how  heavy 
impurities  with  finite  size  (our  gross  model  for  plankters)  are  advected  by  a  fluid  flow. 


3  Lagrangian  advection 

The  motion  of  an  advected  particle  with  the  same  density  of  the  fluid  and  vanishingly  small 
dimensions  (i.e.,  a  fluid  parcel)  is  described  by  the  standard  Lagrangian  equations 

dX 

~dt  ~  U(X’  *)  <9> 

where  X  is  the  particle  position  at  time  t  and  u(X,  t )  is  the  fluid  velocity  at  the  particle 
position.  Notice  that  equation  (9)  is  non-Newtonian,  in  the  sense  that  it  does  not  obey 
the  standard  Newton’s  second  law  of  motion.  The  reason  is  that  the  particle  has  no  inertia 
with  respect  to  the  fluid,  and  thus  we  equate  the  Lagrangian  particle  velocity  dX/dt  to  the 
Eulerian  velocity  u(X,f). 

Impurities  with  finite  size  and  finite  inertia  do  not  behave  like  fluid  particles.  An  impu¬ 
rity  with  density  pp  moving  with  velocity  V  =  dX /dt  in  a  flow  field  u  of  a  fluid  with  density 
Pf  experiences  forces  due  to  gravity,  F9,  Stokes  drag,  Fs,  and  the  effect  of  the  surrounding 
fluid  (mainly  pressure),  F f,  expressed  as 

1  9  upf  Du 

Fg  =  -g{pP~  pf)z,  Fs  =  -j— -6irpfva{V -u)  =  — ^"(V-u),  Ff  =  pf— 

(10) 
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where,  as  before,  a  is  the  radius  of  the  impurity. 


Thus,  using  Newton’s  second  law  equating  force  to  mass  times  acceleration,  we  can  write 
for  the  motion  of  the  impurity 

dV 

=  Fj  +  Fs  +  Fg  .  (11) 

Rewriting  equation  (11)  we  have 

^  =  <12> 

where  <5  =  Pf/pP  and  ra  =  2a2/ (95V).  The  time  scale  ra  measures  the  relaxation  time  of 
the  particle  velocity  to  the  local  fluid  velocity  and  5  determines  the  nature  of  the  buoyant 
particle  (6  <  1  and  5  >  1  indicate  particles  which  are  heavier  or  lighter  than  the  fluid 
respectively). 


In  still  fluid,  where  u 
impurity  as 


0,  one  can  define  the  (downward)  terminal  velocity  of  the  heavy 


2  Pp _ Pf_ 

PfV 


(13) 


For  example,  an  impurity  with  5  =  0.97  and  a  =  30/rm,  as  appropriate  for  a  plankton  cell, 
the  terminal  velocity  W  is  a  few  meters  per  day. 


It  is  important  to  stress  the  difference  between  the  two  time  derivatives  present  in  equa¬ 
tion  (14).  While  D/Dt  =  d/dt  +  u  •  V  measures  the  rate  of  change  along  the  path  that  a 
fluid  particle  would  follow  locally,  d/dt  =  d/dt  +  V  •  V  denotes  the  time  derivative  while 
tagging  along  the  heavy  impurity. 


/ 

/ 
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Figure  2:  A  graphical  representation  of  the  meanings  of  d/dt  and  D/Dt 


Before  proceeding  further,  we  make  eq.(12)  non  dimensional  and  write 

d' V  Du  1  ~  x\  1  ~ 

dt  Dt  SV  J  v  JFr2 


(14) 
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where  the  tilde  is  used  to  denote  dimensionless  variables.  In  the  non-dimensional  equation, 
St  =  TaU/L  and  Fr  =  U/y/gL  are  the  Stokes  and  Froude  numbers  respectively  and  L 
and  U  are  typical  length  and  velocity  scales.  The  Stokes  number  St  represents  the  ratio 
between  the  particle  relaxation  time  scale,  r0,  and  the  dynamical  time  scale  L/U,  whereas 
Fr  quantifies  the  ratio  of  the  buoyancy  time  scale,  l/^/g/L,  to  the  dynamical  time  scale.  In 
the  following  we  will  drop  the  tilde  and  denote  the  non-dimensional  variables  by  the  same 
notation  as  the  dimensional  ones,  unless  a  distinction  needs  to  be  made. 


4  Impurity  advection  in  2D  stationary  flows 

In  this  section  we  study  the  dynamics  of  an  ensemble  of  passive  impurities  advected  by  a 
stationary,  divergence-free,  two-dimensional  (2D)  cellular  velocity  field  without  the  effects 
of  gravity  (we  can  think  of  a  flow  field  on  a  horizontal  plane).  This  example  allows  for 
stressing  the  differences  between  the  dynamics  of  fluid  parcels  and  that  of  heavy  and  light 
impurities;  note  that  in  this  simple  case  one  can  rescale  the  Stokes  number  by  rescaling  time. 

We  first  consider  the  case  of  an  isotropic  streamfunction  =  2(cosx  +  cos  y).  The 
fluid  velocity  is  u  =  (u,v)  and  the  velocity  components  are  given  by  u  =  —dip/dy  and 
v  =  difr/dx.  In  a  2D  flow,  fluid  parcels  obey  the  equations 

dX  _  di[j  dY  _  ch/>  ,  . 

~dt  =  ~dY  ’  ~dt  =  OX  (  ’ 

which  is  formally  an  Hamiltonian  system  with  phase  space  (x,  y )  where  x  and  y  are  canoni¬ 
cally  conjugate  variables  and  the  stream  function  plays  the  role  of  the  Hamiltonian.  In  this 
case,  one  has  conservation  of  phase-space  area  and  no  preferential  fluid  particle  concentra¬ 
tion  is  possible.  For  a  stationary  flow,  fluid  parcels  follow  the  streamlines;  since  the  system 
is  two-dimensional,  there  is  no  chaotic  dynamics  and  the  fluid  parcels  undergo  only  regular 
trajectories. 

The  situation  is  different  for  passive  impurities,  for  which  the  phase  space  (x,y,Vx,Vy) 
is  four-dimensional  and  the  dynamics  is  dissipative.  In  the  case  of  particles  lighter  than  the 
fluid,  6  >  1,  linear  stability  calculations  show  that  the  fixed  points  in  (xn,  ym)  =  (2n7r,  2rmr) 
and  {x'n.  y'm)  =  ((2ra  +  l)7r,  (2ra  +  l)7r)  ( n,m  being  integers)  are  stable  and  attracting,  while 
the  fixed  points  (. xn,ym )  =  (2n7r,  (2m  +  l)7r)  and  (. x'n,y'm )  =  ((2 n  +  l)7r,2m7r)  are  sad¬ 
dles.  For  heavy  particles  (6  <  1),  all  these  fixed  points  are  unstable.  In  this  case,  the 
impurities  undergo  chaotic  trajectories,  hopping  from  one  flow  cell  to  another,  and  display 
non-turbulent  diffusion,  even  in  the  case  of  a  stationary  velocity  field  [3]. 

Figure  4  shows  how  an  initially  homogeneous  distribution  of  heavy  and  light  impurities 
evolves  in  the  prescribed  flow.  The  short-time  dynamics  is  very  similar  for  both  light  and 
heavy  particles.  An  interesting  feature  is  the  transformation  of  an  initially  uniform  particle 
distribution  into  a  distribution  characterized  by  sharp-edged  caustics;  note  that  caustics 
cannot  form  for  fluid  particles,  owing  to  the  conservation  of  phase-plane  area  associated 
with  the  Hamiltonian  nature  of  the  system.  This  constraint  does  not  act  for  impurites, 
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Figure  3:  The  stationary  two-dimensional  streamfunction  i/j  =  2(cosx  +  cos  y)  and  the 
associated  velocity  field. 

owing  to  the  dissipative  nature  of  the  equations  describing  impurity  motion.  For  light 
particles,  the  caustics  spiral  around  the  stable  fixed  points.  Hence,  for  a  stationary  two 
dimensional  flows,  light  particles  tend  to  concentrate  at  the  stationary  points  of  the  flow; 
for  large  enough  times  the  particles  are  practically  absent  in  the  remaining  regions  of  the 
flow.  On  the  contrary,  heavy  particles  exhibit  a  very  different  behavior:  After  the  initial 
formation  of  caustics,  the  heavy  particles  undergo  diffusion  along  the  separatrices  of  the  ad¬ 
verting  flow,  owing  to  the  chaotic  nature  of  the  heavy  particle  dynamics  for  the  parameter 
values  adopted  here. 

Figure  5(a)  shows  the  trajectory  of  an  individual  heavy  impurity  in  the  cellular  flow 
field.  The  particle  alternates  between  spiralling  inward  around  one  of  the  (now  unstable) 
fixed  points  and  escaping  along  one  of  the  unstable  directions,  with  a  quasiballistic  mo¬ 
tion.  Although  this  alternance  of  trapping  and  quasiballistic  motion  could  be  reminiscent 
of  a  Levy  flight,  for  the  system  under  study  the  long-time  probability  distribution  of  the 
particle  velocity  is  Gaussian,  and  it  is  consistent  with  a  standard  Brownian  diffusion  process. 

We  can  compute  the  diffusion  coefficient  for  an  ensemble  of  N  advected  impurities  as  a 
function  of  e  =  1  —  5, 


Dx  =  9^v  X  t Xi ®  ~  Xi(°)]2  (16) 

2—1 

1  N 

Dy  =  &  9 Jn  X  biW  “  y*(°)]2  •  (17) 

2—1 

Figure  5(b)  shows  the  dependence  of  Dx,Dy  on  e.  Since  the  flow  is  isotropic,  the  two  dif¬ 
fusion  coefficients  are  equal.  In  general,  no  anomalous  (non-Brownian)  diffusion  has  been 
observed  for  this  system. 


Ill 


In  the  case  of  a  weakly  anisotropic  velocity  field,  one  observes  some  new  effects.  In 
the  following,  we  consider  the  flow  described  by  the  streamfunction  ip  =  A  cos  x  +  B  cos  y, 
where  A  ^  B.  If  one  looks  at  the  streamfunction  (figure  6),  it  is  evident  that  there  are 
open  “channels”  (with  our  choice  of  parameters,  in  the  horizontal  direction)  where  the  fluid 
particles  undergo  ballistic  motion.  These  are  analogous  to  the  libration  orbits  in  the  phase 
space  of  a  nonlinear  pendulum  (the  closed  orbits  would  correspond  to  periodic  oscillations). 
Thus,  fluid  particles  are  either  trapped  inside  the  cellular  regions  or  undergo  ondulating 
ballistic  motion  in  the  channels. 

A  different  story  is  observed  for  advected  impurities.  As  for  the  isotropic  flow,  light 
particles  sooner  or  later  get  trapped  at  the  stagnation  points.  On  the  other  hand,  heavy 
impurities  spend  a  long  time  in  quasiballistic,  motion  in  the  channels,  and  temporary  trap¬ 
ping  in  the  cellular  regions  is  followed  by  jumping  off  to  another  channel.  Figure  7  shows 
that  an  intially  uniform  particle  distribution  tends  to  become  spread  along  the  channels, 
with  strong  anisotropy. 

In  this  flow,  the  diffusion  coefficients  along  the  x  and  y  directions  are  very  different.  Fig¬ 
ure  8(b)  reveals  that  Dx,  the  diffusion  coefficient  along  the  channels,  diverges  with  e  — >  0, 
Dx  ~ e-06  while  Dy  ~  eL5.  Thus  a  small  anisotropy  in  the  Eulerian  flow  (A  =  2,  B  =  2.2) 
leads  to  diffusion  coefficients  that  differ  by  several  orders  in  magnitude  in  the  case  of  par¬ 
ticles  which  are  only  slightly  heavier  than  the  fluid  (6  =  0.9).  This  could  have  interesting 
consequences  for  the  motion  of  plankton  cells  in  weakly  anisotropic  flows,  as  the  small  den¬ 
sity  difference  between  the  plankton  cells  and  the  water  could  result  in  a  strongly  anisotropic 
diffusion  of  the  particles. 


5  Neutrally  buoyant  impurities  with  finite  size 

Often,  advected  impurities  have  a  density  that  is  rather  close  to  that  of  the  fluid  (as  hap¬ 
pens,  for  example,  with  most  plankton  cells),  but  have  a  finite  size,  i.e.,  they  are  not  fluid 
elements.  To  explore  the  dynamics  of  these  particles,  we  consider  the  case  of  neutrally 
buoyant  particles  with  5  =  pf/pp  =  1  and  finite,  albeit  small,  radius  a.  The  equation  for 
particle  motion  in  this  case  becomes: 


where  now 


(TV 

dt 


Du 

~Dt 


St 


2 

9 


(18) 


If  we  make  the  approximation  that  the  time  derivative  along  the  trajectory  of  an  impurity 
and  that  along  the  trajectory  of  a  fluid  parcel  are  the  same,  i.e.  d/dt  =  D /Dt,  the  problem 
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becomes  very  simple.  In  this  case,  equation  (18)  can  be  written  as 

|,V-u)  =  -i(V-u,. 

Hence 

V  =  u  +  (V  —  u)0  exp^—  ^tj  , 

where  (V  —  u)0  denotes  the  initial  value  of  (V  —  u).  From  this  we  could  infer  that  even 
if  we  release  a  particle  with  an  initial  velocity  which  differs  from  that  of  the  fluid,  after 
a  transient  the  particle  velocity  will  tend  to  the  fluid  velocity,  implying  that  a  neutrally 
buoyant  particle  will  behave,  after  an  initial  transient,  as  an  ideal  tracer. 


In  reality,  the  above  conclusion  is  not  correct,  as  the  problem  is  complicated  by  the  fact 
that  d/dt  and  D/Dt  are  not  the  same.  Equation  (18)  should  then  be  written  as 

^  (V  -  u)  =  -  (V  -  u)  ■  Vu  -  -i  (V  -  u) .  (19) 

Defining  M  =  V  —  u,  equation  (19)  can  be  rewritten  as 


dM 

dt 


-M-  Vu-  —  M 

St 


-|J  +  ^ 


•M, 


(20) 


where  I  is  the  identity  matrix  and  J  is  the  velocity  Jacobian, 


f dxu  dyu\ 
\dxv  dyv]  ’ 


u(x) 


If  we  diagonalise  the  matrix  we  obtain 


dMd 

dt 


■Md. 


From  this  expression,  one  can  obtain  relevant  information  on  the  behavior  of  the  advected 
impurities.  For  a  two-dimensional  stationary  flow,  the  difference  between  the  fluid  velocity 
and  the  particle  velocity  evolves  in  time  depending  on  the  value  of  ±A  —  1/St.  In  particular, 
the  particle  velocity  can  diverge  from  the  fluid  velocity  when 


from  which  one  sees  that  the  stability  of  the  particle  trajectories  (with  respect  to  the  fluid 
parcel  trajectories)  is  determined  by  the  balance  between  the  eigenvalues  of  the  fluid  ve¬ 
locity  field  and  the  Stokes  number.  When  the  local  value  of  A2  (a  property  of  the  Eulerian 
advecting  flow)  is  sufficiently  large,  the  neutrally  buoyant  impurity  can  follow  a  trajectory 
that  is  different  from  that  of  a  fluid  parcel  placed  at  that  point. 


113 


We  can  get  a  further  advancement  by  resorting  to  the  Okubo- Weiss  parameter  [7,  11] 

Q  =  -det  J  =  j  (S2  -  uj2)  ,  (21) 

where  S 2  =  ( dxu  —  dyv )2  +  (dyu  +  dxv )2  is  the  total  strain  (deformation)  and  u>  =  dv/dx  — 
du/dy  is  the  vorticity.  This  parameter  characterises  the  local  shear/rotational  properties 
of  the  flow  and  separates  the  fluid  domain  into  elliptic  and  hyperbolic  regions  [7,  11,  4]: 


Q 


<  0  rotation  —  dominated 
>  0  shear  —  dominated. 


Since  A  satisfies  det(J  —  AI)  =  0,  one  has  that 

A2  —  7YJ  +  det  J  =  0  .  (22) 

In  addition,  as  we  are  now  considering  an  incompressible  flow, 

V  •  u  =  dxu  +  dyv  =  Tr J  =  0.  (23) 

From  (22)  and  (23),  the  Okubo- Weiss  parameter  is  then  Q  =  A2.  If  Q  >  0,  A  is  real  and 
deformation  dominates,  as  around  hyperbolic  points,  whereas  if  Q  <  0,  A  is  imaginary  and 
rotation  dominates,  as  in  the  proximity  of  elliptic  points.  Thus,  we  see  that  hyperbolic 
regions  can  lead  to  a  divergence  between  the  fluid  particle  paths  and  the  trajectories  of  the 
impurities,  if  they  are  “hyperbolic  enough”  (i.e.,  the  value  of  A2  is  large  enough).  As  a 
result,  neutrally  buoyant  impurities  will  be  ejected  from  hyperbolic  regions  and  will  wander 
around  till  they  end  up  in  elliptic  regions,  were  they  will  follow  the  same  trajectory  of 
a  fluid  parcel.  This  potentially  leads  to  a  preferential  concentration  of  neutrally  buoyant 
impurities  in  elliptic  regions,  with  a  biassed  sampling  of  the  advecting  flow. 

5.1  Example  1:  Time  independent  flow 

To  illustrate  the  effects  of  St  and  Q  on  the  dynamics  of  a  neutrally  buoyant  particle,  we 
consider  the  simple  incompressible  two-dimensional  model  flow  defined  by  the  stream  func¬ 
tion  i/j(x,y)  =  A  cos  x  cos  y  [1]. 

Figure  9(a)  shows  the  contour  lines  of  the  stream  function  and  of  the  Okubo- Weiss 
parameter  Q.  Figure  9(b)  shows  the  trajectory  of  a  neutrally  buoyant  particle  whose  initial 
velocity  is  different  from  that  of  the  fluid  at  the  impurity  position.  The  difference  between 
the  velocity  of  the  impurity  and  the  fluid  velocity  is  amplified  in  highly  hyperbolic  regions, 
where  impurity  trajectories  and  fluid  parcel  trajectories  separate  exponentially.  Fig.9(c) 
shows  that  the  neutrally  buoyant  impurity  has  a  long  and  complicated  trajectory,  wandering 
between  different  cells  before  it  comes  across  the  vicinity  of  an  elliptic  point  and  settles  in 
one  of  the  low-Q  regions.  The  time  evolution  of  the  difference  between  the  impurity  velocity 
and  the  fluid  velocity  at  the  impurity  position  is  shown  in  Fig. 9(d).  Once  the  impurity  has 
settled  in  an  ellpitic  region,  the  velocity  difference  between  the  fluid  and  the  impurity  tends 
to  zero  an  no  further  instability  is  observed.  However,  in  their  wanderings  impurities  leave 
hyperbolic  regions,  leading  to  a  preferential  concentration  of  neutrally  buoyant  impurities 
in  the  elliptic  regions  of  the  flow. 
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5.2  Example  2:  Time  periodic  flow 

The  time-dependent  flow  defined  by  the  stream  function  tp(x,  y)  =  A  cos(x  +  B  sin  cot)  cos  y 
shows  interesting  features  [1] .  As  in  a  typical  non-stationary,  two-dimensional  non-integrable 
Hamiltonian  system,  there  are  regions  of  phase  space  where  fluid  parcels  undergo  chaotic 
motion.  Figure  10(a)  shows  the  Poincare  section  of  the  chaotic  trajectory  of  a  fluid  particle, 
here  obtained  by  plotting  the  parcel  position  at  times  t  =  0,  T,  2 T, ...  where  T  =  2it/lu.  The 
trajectory  visits  a  large  part  of  the  domain  but  it  is  bounded  by  two  separatrices  parallel  to 
the  x  direction,  which  forbid  fluid  particle  motion  across  them  (remember  that  the  temporal 
variability  is  only  in  the  velocity  component  along  x). 

The  trajectory  of  a  neutrally  buoyant  impurity,  shown  in  Fig.  10(b),  indicates  that  the 
impurity  can  wander  on  a  larger  portion  of  the  domain,  but  in  the  end  it  converges  to  a 
non-chaotic  trajectory  around  an  elliptic  point.  From  that  moment,  the  impurity  behaves  as 
a  fluid  particle.  However,  the  impurity  has  selected  a  specific  sub-domain  in  space  without 
ever  coming  back  close  to  its  initial  conditions,  at  variance  with  what  would  have  happened 
for  a  Hamiltonian  system.  The  equations  of  motion  for  a  neutrally  buoyant  impurity  are 
dissipative,  and  there  is  no  conservation  of  phase-space  volume.  For  completeness,  Fig.  10(c) 
shows  the  difference  between  the  velocity  of  the  impurity  and  that  of  the  fluid,  as  a  function 
of  time. 

5.3  An  application  to  two-dimensional  turbulence 

The  results  discussed  above  have  interesting  implications  for  two-dimensional  turbulent 
flows  [1,  9].  We  integrate  the  two-dimensional  vorticity  equation  for  an  incompressible  fluid 
with  bi-harmonic  (hyper)viscosity, 

did 

—  +  uj)  =  -vhVAui,  (24) 

where  i/j  is  the  stream  function,  oj  =  V2?/’  is  the  vorticity,  u  =  —dyip  and  v  =  dxip  are  the 
velocity  components  and  the  hyperviscosity  parameter  is  t'/,  =  2.5  x  10~7.  To  integrate  the 
equation,  we  use  a  pseudo-spectral  method  with  5122  collocation  points  in  a  doubly  periodic 
domain;  here  we  consider  a  freely-decaying  case  with  no  external  forcing.  In  this  flow,  an 
initially  homogeneous  distribution  of  neutrally  buoyant  impurities  evolves  in  time  towards 
a  distribution  where  the  impurities  are  concentrated  in  the  inner  part  of  vortices,  where 
Q  <  0,  as  shown  in  Fig.  11.  This  may  have  interesting  consequences  when  we  design  fluid 
experiments  using  neutrally  buoyant  tracer  particles,  as  one  should  verify  that  the  tracers 
really  provide  an  unbiassed  sampling  of  the  velocity  field. 

6  The  role  of  rotation 

The  presence  of  a  rotation  introduces  new  effects  in  the  dynamics  of  the  heavy  impurities 
[9].  When  the  (apparent)  Coriolis  and  centrifugal  forces  are  acting  on  an  impurity,  the 
dimensional  equation  for  the  impurity  motion  becomes 

— —  =  5— - (V  —  u)  —  (1  —  6)  gz  —  2f2  x  (V  —  <5u)  +  |Stj2  R  (1  —  S) ,  (25) 

at  Dt  Ta 
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where  6  and  ra  are  defined  as  before  and  R  is  the  distance  of  the  particle  from  the  axis 
of  rotation.  Notice  that  the  term  —2 SCI  x  u  had  to  be  subtracted  from  the  Coriolis  term 
acting  on  the  impurity  because  it  was  already  taken  into  account  in  the  expression  dDu/Dt. 
Thus,  the  Coriolis  term  acts  on  the  density-weighted  difference  between  the  velocity  of  the 
impurity  and  that  of  the  fluid.  Non-dimensionalising  the  above  equation,  we  obtain 


dV 

dt 


(26) 


where  St  is  defined  as  before  and 


Ro  = 


U 

2RL 


is  the  Rossby  number.  Fig.  12  shows  a  distribution  of  8000  heavy  impurities  which  were  ini¬ 
tially  homogeneously  distributed  in  a  statistically  stationary  numerical  simulation  of  forced 
and  dissipated  2D  turbulence  [9].  The  numerical  integration  has  been  performed  in  the 
doubly  periodic  domain  [0,  27r]  x  [0,  27t]  with  128  x  128  collocation  points.  The  particles 
have  5  =  0.8,  St  =  0.01  and  Ro  =  1/320.  In  the  integration,  we  have  discarded  the  effect  of 
the  centrifugal  force.  The  heavy  impurities  concentrate  inside  the  centers  of  the  anticyclonic 
vortices,  leaving  both  the  turbulent  background  and  the  cores  of  cyclonic  vortices.  This  is 
due  to  the  fact  that  the  Coliolis  force  adds  up  to  the  inertial  force  to  eject  heavy  impurities 
from  cyclones,  whereas  it  works  against  the  inertia  to  push  impurities  towards  the  center  of 
anticyclones.  When  Ro  is  small  enough,  the  Coriolis  force  overcomes  the  centrifugal  effect 
of  inertia  and  the  impurities  concentrate  in  the  cores  of  anticyclonic  vortices.  When  the 
vortices  are  especially  intense,  they  generate  a  strong  centrifugal  force  due  to  the  particle  in¬ 
ertia.  The  strength  of  the  centrifugal  push  by  the  vortex  becomes  weaker  when  the  distance 
from  the  vortex  center  increases.  In  this  case,  the  Coriolis  force  balances  inertia  at  some 
distance  from  the  vortex  center  and  the  impurities  concentrate  in  an  annular  distribution 
inside  the  vortex. 


As  a  final  example,  we  momentarily  leave  the  world  of  plankton  and  discuss  the  (again 
idealized!)  motion  of  dust  grains  in  the  solar  nebula  [2].  In  this  case,  <5  ~  10-8  so  that  we 
may  may  discard  the  term  5  D\ /Dt.  The  impurity  equation  is  then  reduced  to 

^  (V  -u)  -  V<F  -  2fi  x  V  +  |S1|2R,  (27) 

dt  Ta 

where  $  is  a  gravitational  potential.  In  the  specific  case  of  particles  spinning  around  the 
protosun  in  the  solar  nebula  (a  rarefied  fluid  composed  mainly  by  hydrogen)  the  equation 
of  motion  of  a  dust  particle  becomes 

—  (V-u)  -2U  x  V-  ^R+  |II|2R.  (28) 

dt  te  R 


Here,  G  and  M  are  the  gravitational  constant  and  the  mass  of  the  protosun,  respectively. 
For  the  rarefied  conditions  of  the  protoplanetary  nebula,  the  friction  time  scale  te  is  set  to 
have  the  form  te  oc  a/pf,  where  a  is  the  radius  of  the  particle.  This  expression  of  the  drag 
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coefficient  corresponds  to  what  is  called  the  Epstein  regime  for  a  rarefied  gas. 

To  proceed  with  the  study,  we  consider  a  simplified  representation  of  the  protosolar  neb¬ 
ula  by  using  a  two-dimensional  turbulent  flow  subject  to  a  circular ly-symmetric  Keplerian 
shear,  physically  generated  by  the  presence  of  a  central  massive  object  (the  protosun)  [2]. 
Figure  13(a)  shows  the  vorticity  field  at  sufficiently  large  time,  obtained  by  numerical  sim¬ 
ulation  of  this  system.  By  perturbing  the  initial  Keplerian  shear  (which  is  linearly  stable  in 
the  absence  of  magnetic  effects),  both  cyclonic  and  anticyclonic  vortical  perturbations  tend 
to  form.  However,  cyclonic  vortices  are  rapidly  destroyed  by  the  adverse  shear  -  or  do  not 
form  at  all  -  and  only  anticyclonic  vortices  survive  in  the  Keplerian  shear,  as  illustrated  in 
Figurel3(a).  As  already  seen  in  the  case  of  standard  two-dimensional  turbulence  discussed 
above,  the  heavy  impurities  tend  to  concentrate  inside  the  anticyclonic  vortices  as  time 
progresses.  Fig.  13(b)  shows  the  distribution  of  an  ensemble  of  heavy  impurities,  initially 
released  with  a  uniform  distribution,  in  the  flow  field  of  Fig.  13(a).  From  these  results,  one 
could  speculate  that  dust  grains  in  the  solar  nebula  could  be  concentrated  by  the  dynamics 
in  the  cores  of  anticyclonic  vortices,  where  the  subsequent  evolution  (collisions,  gravitational 
attraction)  could  more  easily  ensue. 


7  Conclusion 

The  dynamics  of  passively  advected  impurities  is  quite  different  from  that  of  fluid  parcels. 
From  a  mathematical  point  of  view,  the  phase  space  of  the  impurity  motion  has  twice  the 
number  of  dimensions  of  that  of  the  fluid  parcels,  since  it  is  described  by  particle  posi¬ 
tions  and  velocities  instead  of  just  particle  positions,  as  in  the  case  of  fluid  elements.  In 
addition,  the  presence  of  Stokes  drag  makes  the  system  dissipative,  with  the  possibility  of 
non-homogeneous  particle  distributions  in  space  and  preferential  particle  concentrations  in 
specific  regions  of  the  flow.  Physically,  this  means  that  advected  impurities  may  provide  a 
non-homogeneous  sampling  of  the  fluid  domain.  Even  neutrally  buoyant  impurities,  when 
their  size  is  large  enough,  can  behave  very  differently  from  fluid  parcels.  All  these  results 
indicate  that  small  objects  immersed  in  a  fluid  -  such  as  plankton  cells  in  water  or  dust 
grains  in  the  solar  nebula  -  can  undergo  complicated  dynamics  which  cannot  be  understood 
only  from  considering  the  motion  of  fluid  particles. 

Before  closing  this  lecture,  it  is  important  to  recall  that  so  far  we  have  simplified  the 
description,  neglecting  for  example  the  effect  of  unsteady  forces  associated  with  the  acceler¬ 
ation  of  the  impurities,  such  as  the  added  mass  term  [6]  and  the  Basset  force  [10].  These  two 
terms  become  important  especially  when  5  =  Pf/pP  ~  1.  Full  consideration  of  the  Basset 
term  is  especially  important,  but  difficult  from  a  numerical  point  of  view  owing  to  the  need 
of  storing  the  past  trajectories  of  a  large  number  of  particles.  Further  complications  arise 
from  considering  non-spherical  particles,  a  fact  which  induces  new  effects,  see  for  example 
[8]  for  experimental  results  on  the  dynamics  of  sinking  particles  with  plankton-like  shapes 
and  [5]  for  a  theoretical  study  of  the  dynamics  of  elongated  impurities. 
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(a)  <5  =  1.1 


(b)  5  =  0.9 


Figure  4:  Distribution  of  10,000  impurities  with  density  ratio  5  =  1.1  (left  panels)  and 
<5  =  0.9  (right  panels)  and  St  =  1  in  the  advecting  flow  defined  by  the  stream  function  ^  = 
2(cos  x+cos  y ).  The  individual  panels  refer  to  different  times,  namely  t  =  0, 0.5, 1,  2,  5, 10, 40 
and  100  non-dimensional  time  units. 


Figure  5:  (a)  Trajectory  of  an  individual  advected  impurity  with  density  ratio  6  =  0.9  and 
St  =  1  in  the  flow  defined  by  ^  =  2(cos  x  +  cos  y).  from  t  =  0  to  t  =  10000  non-dimensional 
time  units,  (b)  Diffusion  coefficients  along  x  and  y,  Dx  (stars)  and  Dy  (crosses),  for  an 
ensemble  of  heavy  impurities  as  a  function  of  e  =  1  —  5. 


Figure  6:  Streamfunction  and  the  corresponding  velocity  field  for  the  anisotropic  cellular 
flow  described  by  "0  =  (^4cosx  +  B  cos  y)  with  A  =  2  and  B  =  2.4. 
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Figure  7:  Distribution  of  10,000  impurities  of  density  ratio  5  =  0.9  and  St  =  1  in  the 
Eulerian  flow  defined  by  the  anisotropic  stream  function  ip  =  (Tlcosx  +  Bcosy).  A  =  2, 
B  =  2.4. 


(a)  (b) 

Figure  8:  (a)  Trajectory  of  an  individual  advected  impurity  with  density  ratio  <5  =  0.9  and 
St  =  1  in  the  Eulerian  flow  defined  by  the  anisotropic  stream  function  ip  =  (A  cos  x+B  cos  y ) 
with  ^4  =  2  and  B  =  2.4,  from  t  =  0  to  t  =  10000  natural  time  units,  (b)  Diffusion 
coefficients,  Dx  (stars)  and  Dy  (crosses),  for  heavy  impurities  as  a  function  of  e  =  1  —  5,  for 
A  =  2  and  B  =  2.2. 


Figure  9:  (a)  Fluid  parcel  trajectories  (thick  lines)  and  magnitude  of  the  Okubo- Weiss  pa¬ 
rameter  Q  (background  shading,  lighter  shading  correspond  to  positive  values  of  Q),  for 
the  stream  function  ^  =  A  cos  x  cosy,  (b)  Trajectory  of  a  neutrally  buoyant  impurity  with 
Stokes  number  St  =  0.2  (solid  line)  and  path  of  a  fluid  particle  starting  at  the  same  ini¬ 
tial  position  of  the  impurity  (dashed  line).  The  instability  in  regions  of  high  Q  allows  the 
impurity  to  wander  between  cells,  diverging  from  the  streamlines,  (c)  After  a  complicated 
excursion,  the  neutrally  buoyant  impurity  eventually  settles  in  a  zone  of  low  Q.  (d)  Dif¬ 
ference  between  the  velocity  of  the  impurity  and  the  fluid  velocity  at  the  instantaneous 
impurity  position. 
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Figure  10:  The  upper  panels  show  the  Poincare  sections  of  a  fluid  parcel  trajectory  (a)  and 
of  a  neutrally  buoyant  impurity  with  St  =  0.2  (b)  in  a  time-dependent  cellular  flow  with 
A  =  250,  B  =  0.3,  and  uj  =  1.0.  Panel  (c)  shows  the  time  series  of  the  difference  between 
the  velocity  of  the  impurity  and  the  fluid  velocity  at  the  impurity  position. 


a 


Figure  11:  Small  neutrally  buoyant  impurities  with  Stokes  number  St  =  0.2  tend  to  con¬ 
centrate  in  the  centers  of  vortices  in  a  two-dimensional  turbulent  flow.  Panel  (a)  shows  the 
distribution  of  the  Okubo- Weiss  parameter  Q  at  time  t  =  1  non-dimensional  units  (lighter 
shading  corresponds  to  larger  Q,  that  is,  to  hyperbolic  regions).  Panel  (b)  shows  the  dis¬ 
tribution  of  neutrally  buoyant  impurities  which  were  initially  uniformly  distributed  in  the 
flow  domain. 
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Figure  12:  Distribution  at  t  =  6  non-dimensional  time  units  of  8000  heavy  impurities 
advected  by  forced,  statistically  stationary  two-dimensional  turbulence  in  a  cyclonically 
rotating  reference  frame.  Small  solid  triangles  indicate  the  positions  of  the  advected  particles 
and  thin  solid  curves  indicate  vorticity  isolines.  The  impurities  concentrate  in  the  cores  of 
anticyclonic  vortices.  Parameter  values  are  <5  =  0.8,  St  =  0.01,  and  Ro  =  1/320. 


Figure  13:  Panel  (a)  shows  the  vorticity  field  at  t  =  15  non-dimensional  time  units  for  our 
idealization  of  the  flow  in  a  protosolar  nebula.  Panel  (b)  shows  the  distribution  of  heavy 
impurities  at  the  same  time. 
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Plankton  sinking  and  turbulence 

Lecture  given  on  1  July  2010  by  A.  Provenzale 


1  Introduction 

In  still  fluid,  heavy  particles  tend  to  fall  and  reach,  after  some  time,  a  terminal  velocity 
determined  by  the  balance  between  the  pull  of  gravity  and  the  viscosity  of  the  fluid.  Often, 
plankton  cells  are  slightly  heavier  than  sea  water,  and  as  a  consequence  they  tend  to  sink 
in  the  water  column.  This  sinking  could  then  take  them  below  the  euphotic  layer,  arresting 
photosynthesis. 

To  overcome  sinking,  plankton  have  devised  various  types  of  strategies.  One  is  swim¬ 
ming,  that  is,  an  active,  energy-consuming  self-propulsion  than  can  oppose  gravitational 
sinking.  Other  phytoplankters  can  modify  their  buoyancy,  becoming,  at  least  for  some 
time,  positively  buoyant.  Others  can  exploit  the  turbulence  in  the  mixed  layer,  using  it  to 
stay  suspended  for  longer  times.  This  latter  option  was  discussed  at  length  by  Margalef  [5], 
who  associated  the  level  of  turbulence  in  the  surface  layer  with  the  type  of  organisms  that 
are  environmentally  favoured.  The  terminal  velocity  of  spherical  impurities  with  the  same 
density  grows  with  the  squared  radius  of  the  falling  impurity.  Thus,  larger  plankton  cells 
tend  to  sink  faster  than  smaller  ones,  provided  they  can  be  approximated  by  a  somewhat 
spherical  shape.  When  the  turbulence  in  the  euphotic  layer  is  low,  Margalef  argued,  smaller, 
possibly  swimming  phytoplankters  such  as  flagellates  are  favoured  as  the  larger  species  sink 
more  rapidly  out  of  the  euphotic  zone.  On  the  other  hand,  when  the  level  of  turbulence 
is  high,  larger  plankton  can  benefit  from  turbulent  suspension  and  become  favoured  with 
respect  to  the  small  ones  (which  feel  diffusion  limitation  of  nutrients  more  severely). 

One  potential  problem  with  this  view  is  that  it  is  not  clear  whether  turbulence  really 
favours  a  prolonged  suspension  of  heavy  impurities.  For  example,  the  results  of  an  experi¬ 
ment  by  Ruiz  et  al  [8]  apparently  indicate  that  turbulence  make  heavy  particles  sink  faster 
than  in  still  fluid.  However,  other  experiments  considering  different  flow  configurations  gave 
the  opposite  result,  see  for  example  [1].  A  careful  study  of  this  issue  is  thus  required. 


2  Suspension  of  heavy  impurities 


The  non-dimensional  equation  of  motion  for  a  heavy  impurity  under  the  influence  of  gravity 
is  written  as 


(TV 

dt 


St 


(V-u  +  z  W) 


(1) 
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where  V  is  the  velocity  of  the  impurity,  u  is  the  fluid  velocity,  W  is  the  terminal  velocity  of 
the  impurity  in  still  fluid,  z  points  vertically  upwards  and  St  is  the  Stokes  number  defined 


as 


St 


2 

9h 


where  6  =  Pf/pp  is  the  ratio  between  the  density  of  the  fluid  and  the  density  of  the  impurity, 
L  is  a  typical  lengthscale  of  the  flow,  U  is  a  characteristic  flow  velocity  and  Re  the  Reynolds 
number.  The  terminal  velocity  W  is  found  by  equating  the  gravitational  force  to  the  Stokes 
drag,  and  its  non-dimensional  version  is  given  by 


2.1  Permanent  suspension  in  a  cellular  flow  field 

Stommel  [10]  studied  the  vertical  settling  of  small  heavy  impurities  in  a  simple  flow  configu¬ 
ration  composed  by  a  network  of  vortices  with  horizontal  axis;  all  the  dynamics  takes  plaxe 
in  the  vertical  plane  (x,  z)  and  the  vortices  are  infinitely  long  in  the  y  direction.  Stommel 
neglected  the  pressure  term  SDu/Dt  (which  we  shall  also  do  in  the  rest  of  this  section) 
and  neglected  also  the  inertia  of  the  impurities,  imposing  dV/dt  =  0  in  equation  (1),  which 
gives 

V  =  u  —  zW.  (3) 

Defining  the  two-dimensional  stationary  streamfunction 

T  =  2  (cos  x  +  cos  z) .  (4) 


The  flow  held  is  given  by 

u  =  ( u ,  vj) 

and  the  velocity  of  the  impurity  is 


/  \ 
\  dz'  dx  ) 


V  =  ( u ,  w  -  W) 


_d^_ 

dz  '  dx 


f  d^>  d^> 

y  dz'  dx 


where  —  Wx.  The  solution  is  sketched  in  Figure  1.  The  how  is  characterized 

by  regions  with  closed  streamlines  of  \k,  surrounded  by  jet-like  channels  with  essentially 
downward  velocities,  owing  to  the  effects  of  gravity.  The  impurities  which  end  up  in  the 
downward  jets  fall  faster  than  in  still  fluid,  but  some  impurities  get  trapped  in  the  cells  with 
closed  streamlines  and  remain  suspended  forever.  Stommel  [10]  thus  argued  that  permanent 
suspension  is  possible  also  for  heavy  particles  and  that  turbulence  (or  cellular  how  helds) 
can  keep  heavy  impurities  afloat.  An  extension  of  this  approach  to  the  case  of  a  time- varying 
stream  function  was  discussed  in  [9]. 
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Figure  1:  The  motion  of  passively  advected  heavy  impurities  in  the  cellular  flow  field  con¬ 
sidered  by  Stommel  [10]. 


(a)  Circular  cell 


Figure  2:  (a)  Heavy  impurities  in  an  array  of  circular  flow  cells  drift  away  from  the  center 
owing  to  inertial  effects,  (b)  Impurities  in  a  (frozen)  flow  field  with  more  complex  streamlines 
can  stay  suspended  forever  (from  [7]). 


2.2  The  role  of  inertia 

According  to  Maxey  and  Corrsin  [6]  permanent  suspension  is  actually  not  possible  when 
particle  inertia  is  not  neglected.  Owing  to  the  centrifugal  force  acting  on  a  particle,  in  their 
motion  in  the  circulation  cells  the  impurities  will  spiral  out,  away  from  the  cell  center,  as 
illustrated  in  Figure  2a.  This  has  later  become  known  as  the  sling  effect:  heavy  particles 
tend  to  be  ejected  from  high  vorticity  regions  and  concentrate  in  high  strain  regions  be¬ 
tween  the  flow  cells;  closed  orbits  no  longer  exist  and  heavy  particles  eventually  end  up  in 
the  downdrafts  between  the  cells,  causing  an  increase  of  the  mean  settling  rate  compared  to 
what  happens  in  still  fluid,  in  keeping  with  the  results  of  the  experiment  mentioned  above  [8]. 

However,  consideration  of  flows  with  a  more  complex  streamline  pattern,  such  as  that 
depicted  in  figure  2b,  can  change  the  picture.  The  work  of  [7]  considered  a  spatially  ran¬ 
dom,  two-dimensional,  incompressible  stationary  flow  with  energy  spectrum  E(k)  oc  A;-'5/3. 
In  this  flow,  impurities  are  now  pushed  by  their  own  inertia  outwards  as  well  as  inwards, 
depending  on  the  streamline  curvature.  In  turn,  this  can  lead  to  the  emergence  of  closed 
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trajectories,  which  corresponds  to  a  limit  cycle  in  the  four-dimensional  impurity  phase  space 
(X,Y,Vx,Vy). 

Thus,  the  study  of  the  motion  of  small,  heavy,  spherical  particles  in  a  random,  station¬ 
ary  two-dimensional  flow  shows  that  permanent  particle  suspension  is  possible  when  the 
randomness  suitably  alters  the  curvature  of  the  streamlines  around  an  eddy.  A  stationary 
random  two-dimensional  flow  will  have  two  distinct  groups  of  impurities,  as  shown  in  figure 
3a.  The  particles  in  one  group  will  stay  very  close  to  where  they  started  (that  is,  they  stay 
suspended),  moving  on  limit  cycles  with  closed  trajectories.  The  impurities  in  the  other 
group  move  away  from  where  they  started,  reach  the  downdrafts  and  sink  with  an  average 
settling  speed  which  is  larger  than  their  terminal  velocity  in  still  fluid. 

Owing  to  inertial  effects,  identical  particles  can  thus  have  significantly  different  behav¬ 
ior  when  moving  in  a  random  steady  flow.  Some  particles  are  swept  into  the  downdrafts 
between  the  eddies  and  settle  at  rates  that  are  larger  than  in  still  fluid.  Other  particles 
remain  suspended,  moving  on  closed  trajectories,  so  that  their  vertical  motions  are  oscil¬ 
latory.  These  two  different  types  of  behavior  result  in  a  bimodal  distribution  of  vertical 
displacements.  As  a  result,  the  mean  settling  velocity  can  be  either  larger  or  smaller  than 
the  terminal  velocity  in  still  fluid  depending  on  the  fraction  of  suspended  particles. 

For  a  time- varying  random  two-dimensional  flow,  the  distinction  between  the  population 
of  suspended  impurities  and  that  of  rapidly  falling  ones  does  not  remain  so  strict  as  particles 
can  move  from  one  group  to  the  other.  In  this  case,  permanent  suspension  is  no  longer 
possible,  but  suspension  can  still  last  for  a  long  time.  Overall,  these  results  confirm  the  view 
of  Stommel  and  Margalef,  who  claimed  that  turbulence  could  indeed  help  heavy  impurities 
to  stay  suspended  for  longer  times  than  in  still  fluid. 

3  Phytoplankton  competition  in  turbulent  waters 

As  turbulence  affects  the  sinking  rate  of  phytoplankton,  one  may  go  back  to  the  old  Mar¬ 
galef  question  and  ask  how  populations  of  phytoplankton  species  with  different  size  respond 
to  a  turbulent  environment.  Larger  plankton  species  seem  to  be  favoured  in  conditions  of 
strong  turbulent  suspension  while  smaller  phytoplankton  can  thrive  in  quieter  conditions. 

To  explore  this  issue,  we  follow  [3]  and  consider  two  phytoplankton  species  with  radius 
ai  and  a2  that  compete  for  the  same  nutrient,  N.  The  population  dynamics  is  described 
by  the  following  (non-dimensional)  NP  system: 

DN  N  Pt  NPo 

=1  -  — — Q(ai)  -  f3(a2)  —  ~NQ(a2)  +  n ntu  [PiQ(ai)  +  P2Q(a2 )] 

~  im  +  s(ai)]pi  (5) 

DP2  al  .  np2 
dF  =l3MkTN  - [m  +  5(02)1  P2' 
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(a)  Frozen  flow  field 


AY(Lo) 

(b)  Time-evolving  flow  field 


Figure  3:  (a)  In  a  frozen  turbulent  flow  some  impurities  can  stay  suspended  forever,  while 
others  move  away  from  their  initial  position  and  fall  faster  than  in  still  fluid.  As  a  result, 
the  distribution  of  particle  displacements  is  bimodal.  The  three  curves  show  the  probability 
distributions  of  the  vertical  displacement  of  falling  impurities  at  different  times  and  the 
arrows  indicate  the  displacement  that  the  impurities  would  have  had  if  they  moved  at  the 
Stokes  settling  velocity  W.  On  the  left  of  the  plot,  there  are  particles  which  stay  suspended 
forever  and  their  displacement  from  the  initial  position  does  not  grow  with  time,  (b)  In 
a  time-evolving  random  field  there  is  no  strict  separation  between  suspended  and  falling 
particles  and  permanent  suspension  is  not  possible,  but  a  large  fraction  of  the  impurities 
can  fall  more  slowly  than  in  still  fluid.  From  [7]. 


where  Pi  and  P2  are  the  concentrations  of  the  two  species  of  phytoplankton,  f3  is  a  growth 
rate,  k  is  a  limiting  term  for  phytoplankton  growth,  m  is  a  size-independent  natural  phy¬ 
toplankton  mortality  rate,  s  is  the  size-dependent  sinking  rate,  and  Q  is  the  nutrient  quota 
of  phytoplankton,  used  to  transform  biomass  into  nutrient  amount. 

The  terms  in  the  nutrient  equation  of  system  (5)  describe  two  sources  of  nutrients:  a 
constant  external  nutrient  source,  /,  and  the  nutrients  regenerated  from  dead  phytoplank¬ 
ton,  as  described  by  the  recycling  term  [Pi  Q(a\  )  +  FbQ^)]  with  //jv  <  1-  Nutrients 
are  consumed  by  phytoplankton  at  a  rate  (3 ,  describing  the  phytoplankton  efficiency  to  up¬ 
take  nutrient  which  is  assumed  to  be  a  function  of  phytoplankton  size. 


In  their  study,  Jiang  et  al  [3]  recalled  that  there  is  a  complex  relationship  between  the 
efficiency/growth  rate  /?  and  the  size  of  phytoplankton  species.  Observations  show  that  the 
growth  rate  of  relatively  large  phytoplankton  species  decreases  with  increasing  size  while 
the  opposite  tends  to  be  true  for  small  phytoplankton  species.  This  suggests  that  there 
should  be  an  optimal  phytoplankton  size  at  which  the  growth  rate  is  maximized.  A  simple 
analytic  form  of  the  phytoplankton  growth  rate  that  has  these  characteristics  is 


/3(a) 


a 

aq  a2  +  0:2a  +  0:3 


(6) 


where  aq,  0:2,  and  0:3  are  positive  constants.  The  form  of  the  growth  rate  described  in  (6) 
is  only  meant  to  capture  the  qualitative  structure  of  observed  phytoplankton  growth  rates 
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Figure  4:  Outcome  of  species  competition  in  a  turbulent  environment,  parameterized  by 
the  values  of  the  exponent  a  and  the  coefficient  so-  Red  (in  the  upper  left  area)  indicates 
dominance  of  the  larger  species  with  a  =  1.4,  blue  indicates  dominance  of  the  smaller  species 
with  a  =  0.7.  Cell  radii  are  in  non-dimensional  units. 

and  is  not  necessarily  universal. 

The  growth  of  phytoplankton  populations,  thanks  to  nutrient  consumption,  is  balanced 
by  plankton  mortality,  m,  and  plankton  sinking,  s(a).  In  the  phytoplankton  competition 
model  (5),  the  mortality  rate  rn  is  assumed  to  be  independent  of  phytoplankton  size,  un¬ 
like  the  sinking  term.  In  the  absence  of  turbulence  and  assuming  spherical  particles,  the 
settling  velocity  of  an  individual  phytoplankter  can  be  derived  from  Stokes  law  obtaining 
s(a )  =  so®2;  this  form  was  assumed  in  [3]  with  so  =  2.  With  this  choice,  it  was  found  that 
the  most  favoured  species  is  always  the  smaller  one,  and  that  there  should  be  a  natural 
tendency  towards  dominance  of  smaller  phytoplankton  species.  In  this  view,  the  higher  ef¬ 
ficiency  in  nutrient  uptake  of  the  larger  species  is  not  enough  to  balance  their  faster  sinking 
rate,  leading  to  a  dominance  of  the  smaller  organisms. 

In  reality,  the  sinking  rate  depends  on  the  level  of  turbulence,  which  can  alter  the  above 
result.  In  a  subsequent  study,  Demonte  et  al  [2]  kept  all  parameter  values  as  in  [3]  but 
assumed  the  sinking  rate  to  be  proportional  to  a  power  of  the  phytoplankton  cell  radius,  o, 
through  a  simple  relationship 


s{a)  =  s0aa  (7) 

where  so  is  the  coefficient  and  a  is  the  exponent.  The  motivation  of  this  choice  is  that 
turbulence  can  alter  the  simple  dependence  of  sinking  velocity  on  radius  observed  in  still 
fluid.  What  is  found  in  this  case  is  that  the  outcome  of  species  competition  depends 
on  the  properties  of  turbulence,  that  is,  on  the  assumed  values  of  sq  and  a.  Figure  4 
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shows  the  outcome  of  competition  for  two  species  with  (non-dimensional)  radius  0.7  and 
1.4,  respectively.  For  the  parameter  values  chosen  in  [3],  the  smaller  species  is  favoured. 
However,  by  varying  the  values  of  a  and/or  so,  one  can  get  the  opposite  result.  Even  by 
keeping  a  =  2  but  varying  the  coefficient  so,  one  can  obtain  dominance  of  the  larger  species. 
This  is  consistent  with  the  view  that  the  outcome  of  competition  of  phytoplankton  species 
with  different  size  depends  on  the  properties  of  the  turbulent  environment,  and  it  seems  to 
further  confirm  Margalef’s  view. 

4  Vertical  velocities  in  mesoscale  flows 

In  the  previous  sections  we  have  seen  that  turbulent  suspension  can  be  an  important  mech¬ 
anism  in  plankton  dynamics.  We  can  ask  whether  these  effects  are  visible  also  on  scales 
larger  than  those  of  homogeneous  and  isotropic  turbulence:  for  example,  whether  they  can 
play  a  role  at  the  ocean  mesoscales  where  fronts,  vortices  and  waves  dominate  the  flow,  as 
discussed  in  the  first  lecture. 

Although  a  final  answer  is  still  missing,  an  important  point  concerns  the  structure  and 
intensity  of  the  vertical  velocity  field  at  the  mesoscales.  If  there  are  strong  and  organized 
vertical  velocities,  then  these  can  also  induce  suspension  (in  upwelling  regions)  and  rapid 
sinking  (in  downwelling  areas),  thus  affecting  plankton  dynamics,  besides  their  role  in  de¬ 
termining  the  nutrient  input  to  the  euphotic  layer. 

In  proximity  of  fronts,  vertical  velocities  can  be  very  intense,  creating  a  complex  pattern 
of  upwelling  and  downwelling.  However,  other  mesoscale  structures  can  also  play  a  role:  co¬ 
herent  vortices,  in  particular,  have  been  shown  to  be  associated  with  a  complicated  pattern 
of  strong  vertical  velocities  [4].  In  their  study,  Koszalka  et  al.  have  considered  a  primitive 
equation  model,  using  the  ROMS  ocean  model,  with  doubly  periodic  boundary  conditions 
and  idealized  surface  forcing.  The  simulations  show  the  emergence  of  coherent  vortices, 
similarly  to  what  happens  in  two-dimensional  or  quasi-geostrophic  (QG)  turbulence.  The 
horizontal  dynamics  and  transport  properties  of  this  flow  are  not  too  dissimilar  from  what 
happens  in  QG  turbulence,  suggesting  that  approaches  based  on  simple  QG  models  capture 
a  relevant  portion  of  the  horizontal  dynamics.  However,  the  vertical  velocities  associated 
with  the  vortices  in  the  primitive  equation  setting  can  be  very  strong,  at  variance  with  the 
situation  found  for  QG  vortices. 

Figure  5a  shows  a  slice  of  the  vorticity  field  in  one  of  the  primitive  equation  simulations 
discussed  in  [4],  and  figure  5b  shows  the  vertical  velocity  field  associated  with  one  of  the 
vortices.  The  vertical  valocities  in  proximity  of  the  vortex  are  very  intense,  and  can  lead 
to  various  consequences  for  the  marine  ecosystem,  ranging  from  enahnced  nutrient  input 
to  the  suspension  of  phytoplankters  in  the  upwelling  areas  and  their  rapid  sinking  in  the 
downwelling  regions. 
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Figure  5:  Panel  (a):  The  surface  vorticity  field  produced  by  a  numerical  simulation  of  forced- 
dissipated  primitive  equations  with  doubly  periodic  boundary  conditions  in  a  domain  with 
lateral  size  256  km  and  depth  1000  meters.  Forcing  is  provided  by  idealized  surface  (wind) 
forcing  and  relaxation  to  a  given  density  profile.  The  relative  vorticity,  C,  is  normalized  by 
the  Coriolis  parameter,  /.  Panel  (b):  The  vertical  velocity  field,  in  m/day,  at  a  depth  of  78 
meters,  for  the  vortex  indicated  by  the  white  segment  in  panel  (a).  From  [4], 

5  Conclusion 

In  these  lectures  I  tried  to  illustrate,  by  various  examples,  how  turbulence  affects  aquatic 
organisms.  The  overall  picture  shows  that  turbulence  is  an  important  component  of  these 
ecosystems  and  it  must  be  properly  considered  to  obtain  a  quantitative  description  of  ecosys¬ 
tem  functioning.  From  the  small  scales  of  homogeneous  and  isotropic  turbulence,  where 
individual  plankton  cells  swim,  sink  and  are  transported  by  the  fluid  flow,  to  the  ocean 
mesoscales  where  vortices,  fronts  and  waves,  with  their  associated  upwelling  and  down- 
welling  regions  determine  the  plankton  distribution,  to  the  large  scales  where  the  basin 
circulation  controls  the  overall  structure  of  the  marine  ecosystem,  fluid  dynamics  is  at  the 
core  of  the  biological  processes  occurring  in  aquatic  environments.  There  are  still  many 
open  questions,  some  of  which  can  possibly  be  addressed  by  the  methods  of  GFD  and  of 
dynamical  system  analysis  as  illustrated  here. 
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Evolutionary  Models:  Movement  and  Mixing  in  Trait  and 
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Over  generations,  a  species  will  evolve  to  increase  its  fitness  to  its  environment.  Natural 
selection  is  the  primary  mechanism  by  which  this  occurs,  wherein  environmental  pressures 
cause  individuals  with  more  fit  phenotypes  to  reproduce  at  a  higher  rate.  For  the  purpose 
of  modeling,  the  space  of  possible  heritable  traits  may  be  viewed  as  a  continuum,  and  a 
particular  individual  as  inhabiting  a  point  in  trait  space.  Making  the  biomass  continuous  as 
well,  a  population  is  viewed  as  a  distribution  in  trait  space.  In  this  view,  natural  selection  is 
the  process  by  which  the  distribution  of  biomass  propagates  through  trait  space,  generally 
increasing  its  average  fitness. 

1  Building  blocks  for  models 

The  first  step  in  modeling  a  system  in  which  the  dominant  traits  can  change  is  defining  a 
suitable  measure  of  how  much  of  a  species  there  is.  While  the  actual  number  of  members  of 
the  species  is  an  option,  a  more  useful  measure  is  the  biomass,  b,  which  readily  translates 
across  species.  The  biomass  density  can  be  treated  as  a  function  over  space,  time,  and  this 
“trait-space” 

fr(x,  t\s\,  S2,  •••) 

where  the  variables  s  =  {si,  S2,  •  •  •}  specify  the  phenotype,  or  traits,  of  the  organism.  In 
addition  to  the  biomass,  the  rates  of  survival  and  reproduction  depend  on  the  environmental 
conditions  and  vary  with  phenotype.  These  rates  are  defined  as 

n  =  n(s\E)  ,  n{Sl\E)  y  tz(s2\e) 

where  E  represents  the  environmental  conditions.  The  biomass  evolution  is  related  to  the 
growth  rate  (including  reproduction)  by 

^b(s)  =  K(s\E)b(s)...  (1) 

indicating  that  new  organisms  (represented  as  biomass)  have  the  same  values  of  s  as  their 
parents.  However,  in  nature  genetic  mutation  can  occur  causing  the  offspring  of  parents  to 
have  a  slightly  different  trait  values.  (And,  since  these  represent  phenotypes,  there  is  some 
degree  of  natural  variation  as  well.)  This  can  be  modeled  by  changing  Equation  1  to 

-^-b(s)  =  K(s\E)b(s)  +  V2smb  (2) 
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where  m  is  a  type  of  trait  diffusion  (which  can  be  proportional  to  the  reproduction  rate). 
The  rate  appears  within  the  derivatives  for  the  same  reasons  it  does  in  the  Fokker-Planck 
equation  (or  the  representation  of  kinesis  in  a  previous  lecture)  -  the  length  of  the  random 
jump  in  trait  space  is  set  at  the  beginning  of  the  excursion  (and  there  is  no  reason  to  expect 
non-divergence).  In  any  case,  we  shall  just  use  the  over-simplification  of  treating  m  as 
constant. 

The  simplified  model  created  in  Equation  2  obviously  has  a  number  of  flaws.  If  a  trait 
is  not  represented  in  the  initial  modeling  it  will  never  arise  long  term  meaning  that  a  full 
trait  space  needs  to  be  known  a  priori.  Additionally,  the  time  scales  are  not  well-defined. 
While  there  is  some  idea  of  the  rate  of  genetic  changes,  not  much  about  how  this  translates 
into  alterations  of  the  phenotype  is  understood.  Finally,  since  natural  selection  acts  on  an 
individual  basis;  a  model  based  on  biomass  may  not  accurately  model  the  dynamics  for  low 
values  where  extinction  is  probable. 

If  we  extend  the  above  model  to  start  accounting  for  advection  and  diffusion  in  space  as 
well  as  diffusion  in  trait  space,  we  can  use  the  following  equation  to  model  a  wide  variety 
of  systems 

-  VkV6  =  bU{s\E)  +  V2smb. 

With  this  governing  equation  the  following  topics  will  be  examined 

•  Evolutionarily  stable  strategies:  what  s  values  are  selected  for? 

•  Relationship  to  stability  theory 

•  Adaptive  dynamics  in  the  presence  of  mutation 

•  Effects  of  mixing,  diffusion,  and  advection 


2  A  Simple  Model 


To  make  a  model  which  can  be  studied  easily,  it  is  necessary  to  create  a  function  for 
the  growth  rate.  For  a  first  model,  we  make  a  number  of  simplifying  assumptions:  the 
organisms  will  have  a  ID  trait  space,  s  =  s,  and  use  a  common  resource,  N,  so  that  the 
amount  available  per  individual  is 


NI  =  N 


_ bo _ 

I  ds  b(s) 


where  &o  corresponds  to  the  biomass  of  a  single  individual  and  the  notation  (b)  =  f  ds  b 
has  been  used.  Growth  will  occur  when  Nj  >  Ncra  but  saturates  to  a  maximum  value  gm 
when  the  resources  are  plentiful.  For  example, 


9  —  9m 


A Ti  -  Ncrit 
Nj 


for  Nj  >  Ncrit, 


which  results  in  a  logistic-like  system: 


9m 


,  d2 

b  +  m,-—r;b 
osz 


(3) 
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The  coefficients  gm  (maximum  growth  rate),  d  (death  rate),  and  bc  =  Nbo/Ncrit  (carrying 
capacity)  are  functions  of  s.  For  this  form, 

K(s\E)  =  K(s\(b))  =  gm  (f  -  ■£)  -  d. 

The  environment  here  is  the  density  of  competitors  for  the  resource  as  well  as  the  value  of 
N  implicit  in  the  carrying  capacity  bc. 


3  Evolutionarily  Stable  Strategies  (ESS) 


If  the  system  modeled  by  Equation  3  is  analyzed  without  considering  mutation  (m  =  0), 
there  are  many  possible  singular  solutions  of  the  form 


b(s,  t )  =  b(t)S(s  —  s ) 


with  b  satisfying 


p  =  K(s\b)b 


since  E  =  (b)  =  b.  This  results  in  just  the  logistic  equation  and  steady  solutions  where 
TZ(s\b)  =  0  indicating 

d{s) 


b  bc(s)  1  - 


9m.{s , 


Let  us  introduce  a  different  organism  with  phenotype  s':  b  =  b(t)S(s  —  s)  +  b'(t)S(s  —  s'); 
if  its  biomass  is  very  low  when  compared  to  b.  Equation  3  becomes 


-  nm  = 


9m(s')  (  1  - 


bc(s') 


-  d(s') 


and  when  the  coefficients  are  time- independent,  we  can  write  this  in  the  simpler  form 


ld_ 

br  dt 


b' 


9m(s') 


b(s')  -  b(s) 

bc(s') 


where  b(s')  is  the  equilibrium  population  for  s  =  s'.  For  this  model,  the  population  with  the 
highest  equilibrium  value  will  exclude  all  others:  b'  will  decay  if  b(s')  <  b(s).  Alternatively, 
the  ESS  maximizes  bc(s)[gm(s)  —  d(s)\/gm(s),  and  it’s  also  the  one  which  can  survive  on 
the  minimum  resource  Nj. 

In  other  situations,  there  may  be  no  single  species  which  can  out-compete  all  others. 
For  example,  consider  a  case  in  which  different  species  use  somewhat  different  resources  so 
that  Nj  for  a  species  s  depends  on  the  nearby  biomass  but  not  species  far  away  in  trait 
space.  E.g,  is  s  represents  the  size  of  a  herbivore,  it  will  generally  feed  on  a  limited  range 
of  plant  sizes.  This  kind  of  local  competition  could  look  like 


E  —  gm 


ds'w(s\s')b(s') 


d 


with  w  a  peaked  function.  Since  we  will  take  bc  to  be  constant,  we  can  just  choose  it  to  be 
one  hereafter;  w  will  be  normalized  to  have  unit  integral.  Figure  1  shows  an  example  with 
speciation  events  and  a  final  state  with  6  “species.” 
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Figure  1:  Biomass  density  vs.  time  for  the  case  of  local  competition.  The  y  axis  is  s;  the 
x  axis  is  time,  but  on  an  uneven  scale.  The  black  lines  show  where  the  time  scale  changes 
from  1  day  per  unit  to  5,  10,  50,  and  500  days. 

4  Adaptive  dynamics 

Dieckmann  points  out  that  the  dynamics  may  be  approximated  by  equations  for  the  net 
population  and  the  rate  of  movement  in  trait  space. 

We  can  look  at  the  (local)  mean  biomass  and  the  mean  value  of  s 

( b )  =  fsodsb(s ) 

<->  =  (sb)/(b) 

Applying  the  local  average  to  a  system  defined  in  Equation  2  and  ignoring  fluxes  out  of  the 
range  by  mutation,  we  get 

^L(b)  =  (n(s,b)b) 

which  is 

-K{(s),b)(b)  +  ^K"a2(b) 

from  Taylor  expansion.  For  evaluating  the  competition/  environmental  effects,  we  can  treat 
the  biomass  distribution  as  a  set  of  singular  values 

b  =  ( b)i5(s  -  ( s)i ) 
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Taking  the  local  s  moment  and  expanding  gives 


k{s>(6)  =  (sKb) 

=  ((» -  <‘)m 

~  IZ'a2  (b) 


The  speed  at  which  the  peaks  migrate  towards  the  optimal  value  depends  on  the  gradient 
and  on  the  width  of  the  distribution  a.  Spread  arises  from  mutation  or  from  migration 
mixing.  We  could  derive  an  equation  for  <r;  it  tends  to  increase  because  of  mn  and  decrease 
because  of  negative  curvature  in  7Z  which  acts  to  sharpen  peaks. 

As  a  simple,  and  analytically  tractable  example  for  a  single  peak,  consider  a  system 
with 


-R2(s  -  s0) 


2 


(b) 


where  Ro,  R2,  and  so  could  be  functions  of  time  as  the  population  or  other  processes  alter 
the  resource.  We  can  find  a  time-dependent  solution  (without  needing  to  approximate) 


b  =  bo(t)  exp 


1  [s  —  x(t )]2 

2  u2(f) 


when 


/  d 

rd—a  = 
dt 


1„*  q  m 

-R*crs  H - 

2  a 


d_ 

dt 


X  =  —R2<JZ(x  —  So) 


1  d  _  1  2  m 

--b0-R0--R2(x-s0)  -  — 


with  R*  =  Rj(  1  —  abo). 

The  center  of  the  distribution  moves  towards  the  peak  in  the  growth  rate  curve  at  a  speed 
which  depends  of  the  slope  at  the  current  center  location  [— R%{x  —  so)]  and  the  variance 
of  the  distribution.  The  width  asymptotes  to  the  value  {2m/ i?^)1/4  (if  1Z"  is  negative) 
which  narrows  as  the  growth  rate  curve  becomes  more  sharply  peaked.  The  population  will 
increase  or  decrease  depending  on  the  sign  of  Rq  —  m/a2',  the  dynamics  of  the  resource  will 
adjust  Rq  until  it  reaches  the  equilibrium  value. 

The  environment  will  be  time-dependent  because  of  many  external  factors,  so  that  1Z 
itself  has  a  whole  spectrum  of  variability.  The  velocity  for  movement  in  trait  space  will 
therefore  be  fluctuating,  and  we  cannot  expect  the  system  to  be  in  equilibrium.  If  the 
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external  changes  are  very  slow  compared  to  mutation  times,  the  system  will  be  very  close 
to  the  ESS  for  current  conditions: 


dlZ(s,  t ) 


dsj 


s=s(t) 


~  0 


In  contrast,  if  the  environmental  fluctuations  are  fast,  the  center  of  the  distribution  will  not 
be  able  to  keep  up,  and  the  mean  trait  value  will  settle  at  the  point  where  the  time-averaged 
velocity  is  zero 


dlZ(s,  t ) 


dsj 


~  0 


For  intermediate  time-scales,  comparable  to  mutation  times,  the  center  of  the  distribution 
will  partially,  but  not  completely,  track  the  variations.  The  population  dynamics  is  implicit 
here,  basically  maintaining  as  we  shall  see,  it  may  play  a  much  more  active  role 

with  multiple  trophic  levels. 


5  Time-dependent 

We  can  also  consider  a  system  distributed  in  trait-  and  physical  space  in  which  a  changing 
environment  produces  time-dependent  forcing.  A  simple  way  to  do  this  is  to  change  the 
reproduction  rate  in  such  a  way  that  favors  different  phenotypes  at  different  times.  This 
could  model  such  changing  environmental  parameters  as  mixed  layer  depth,  or  sunlight, 
for  example.  During  the  portion  of  the  forcing  cycle  when  a  certain  phenotype  is  not  fit, 
mutation  can  replenish  such  populations  which  would  otherwise  slowly  die  out.  We  can 
then  see  a  seasonal  cycle  in  the  community  structure  as  well  as  the  biomass.  Alternatively, 
selective  grazing  in  which  the  zooplankton  feed  more  heavily  on  the  more  abundant  species 
can  also  prevent  the  non-optimal  species  from  disappearing. 

If  the  growth  rate  is  time-dependent,  ^  0,  a  time-average  of  the  biomass  can  be 
taken  to  find 

f  ln17W)  =  =  R(s'\s) 

Then,  based  on  this  result,  s  will  represent  an  evolutionarily  stable  strategy  if 

i?(s|s)  =  0  [definition]  and  R(s'\s)  <0  for  s'  /  s 

which  indicates  that  organisms  with  a  non-optimal  phenotype  will  die  out  representing 
natural  selection.  Locally,  these  conditions  become 

n  n2 

-^R(s'\s)\s,=.  =  0  ,  -^R(s'\s)\s,=-<0, 

but  since  the  competition  is  temporally  local,  it  is  possible  that  several  species  could  coexist 
with  one  or  another  dominant  at  different  times. 
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Figure  2:  Biomass  density  vs.  time  for  s  =  0.41  (late)  and  for  the  case  of  local  competition. 
The  y  axis  is  s;  the  x  axis  is  time,  but  on  an  uneven  scale.  The  black  lines  show  where  the 
time  scale  changes  from  1  day  per  unit  to  5,  10,  50,  and  500  days. 
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6  Relationship  to  stability  theory 

To  understand  the  relationship  to  stability  theory,  look  again  at  the  linearized  model 


with  the  basic  state  having 


|6'  =  K(,|5)6'  +  5||W 


b(s)'R.(s\b(s))  =  0 


Standard  problems:  b(s)  /  0  =>■  7 Z(s,b(s))  =  0,  the  first  term  vanishes,  and  we  deal 
with  the  second  term,  often  in  the  form 


d  ,  _  r  dnt 

dtbi  - bi  db, 


6' 


b=b 


Evolutionarily  Stable  Strategies:  b(s)  =  0  for  s  /  s  so  that  lZ(s\b)  ^  0  and  the  first 
term  is  the  important  one. 


In  the  case  with  mutation 


b1Z(s\b)  +  \72sm.b  =  0 


^b'  =  n{s\b)b'  +  b^(b')  +  V2smb' 
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both  terms  enter.  Near  the  “hot-spots”  where  b  is  large,  1Z  is  relatively  small,  the  second 
term  dominates  and  the  problem  looks  like  a  standard  stability  system  modified  by  diffusion. 
Far  away,  however,  b  is  small,  and  the  basic  state  has  a  decaying  form  with  1Z  nonzero.  If 
1Z  changes  and  becomes  positive  in  that  region,  the  population  can  “tunnel”  into  the  local 
maximum  and  grow.  The  amount  of  time  this  will  take  depends  on  distance  and  the 
mutation  rate. 


7  Physical  mixing 

We  now  consider  a  model  with  a  single  spacial  dimension,  y,  in  addition  to  the  single  phe¬ 
notype  dimension,  s.  Our  growth  rate  varies  in  physical  space,  and  the  biomass  also  diffuses 
in  physical  space,  but  we  neglect  the  diffusion  in  trait  space  that  represents  mutation. 

§ib  =  bTZ(s,y\(b))  +  K-^b, 

with 

n=[Ro-  \R2(s  -  S0(y))2]  [1  -  ( b )]  -  d. 

Figure  3  shows  the  biomass  concentrates  into  a  limited  number  of  species  occupying  over¬ 
lapping  latitude  ranges;  intermediate  species  are  excluded. 


b 


S 


Figure  3:  Biomass  density  b(s)  vs.  y  (non-dimensional)  with  physical  mixing. 

Because  there  is  no  mutation,  we  can  analyze  this  behavior  by  examining  singular  solu¬ 
tions  with  the  single-phenotype  ansatz, 

b(s,y,t)  =  b(y,  t)5(s  -  s). 
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Applying  this  ansatz  yields  the  PDE  in  y  and  t  that  governs  6, 


B  -  B2 

—b  =  ([R0  -  \R2(s  -  s0(y))2]  [1-5]  -d)b  +  K-^b. 

Note  that  the  linear  part  of  this  operator  is  a  Schrodinger  equation, 


B  -  -  B'2  - 

-h  =  r(y)b  +  *—2b, 


which  has  a  trapped  mode  if  the  region  where  lZ(s,y  |0)  >  0  is  big  enough  and  strong  enough 
to  overcome  diffusive  losses  into  the  region  where  the  death  rate  dominates  (Kierstead  and 
Slobodkin). 

To  determine  whether  a  steady  single-phenotype  solution  is  linearly  stable  to  nearby 
species  (i.e.  is  a  local  ESS),  consider  small  perturbations,  b(s,y,t)  =  b(y)  +  b'(s  +  s',y,t ). 
To  linear  order  in  s'  and  b' ,  the  perturbations  are  governed  by 


_  BTZ  -  B2 
b'lZ(s,  y,  b )  +  s'b—(s,  y,  b)  +  Kg^2b'- 


If  s  is  a  local  ESS,  the  growth  rate  for  b'  will  be  zero,  and  we  would  need  one  higher  order 
is  s'  to  decide  if  the  state  is  a  local  minimum  or  maximum.  Thus,  we  want  to  find  the  s 
such  that 

_  BTZ  -  B2 

0  =  b'U(s,  y ,  b )  +  s'b—(s,  y ,  b)  +  b' ■ 

in  the  limit  as  s'  —>  0.  Multiplying  this  equation  by  b  and  the  equation  for  the  latter  by  b' 
and  integrating  over  space  implies 


,  -  , ,  BTZ  .  — . 

dx  bb  — ( s,y,b )  =  0 


In  the  limit,  b'  — >  b ,  so  we  can  pick  an  s,  find  6,  evaluate  this  integral,  and  search  for  the 
s  where  it  vanishes.  Species  near  this  value  will  be  excluded;  however,  the  growth  rate  for 
ones  far  away,  found  from 

B2 

ab'  =  7 Z(s,  y,  b)b'  +  n— ^b' , 

may  be  positive.  In  that  case,  another  band  will  appear.  This  can  happen  because  b  varies 
spatially,  so  it  is  not  effective  at  excluding  others  in  the  regions  where  b  is  small  and  is  dying 
out  but  sustained  by  diffusion  from  the  positive  growth  regions. 

Simulations  of  this  spatial-diffusion  model  give  the  following  qualitative  results: 

•  The  single  species  solution  has  reduced  amplitude  and  will  die  out  if  k  is  big  and  the 
volume  average  of  72.(s|0)  is  negative. 

•  If  b(s,  y)  is  large  enough,  single  solutions  may  damp  the  rest  of  the  held,  even  when 
it  would  otherwise  grow. 

•  Discrete  species  with  different  phenotypes  emerge  in  spatially-separated  bands. 
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8  Predator-prey 


We  now  present  a  model  with  a  predator  species  (zooplankton,  Z)  and  a  prey  species  (phy¬ 
toplankton,  P)  evolving  in  a  one-dimensional  trait  space.  The  prey  has  a  trait-dependent 
birth  rate,  the  predator  has  a  constant  death  rate,  and  the  predator  feeds  on  the  prey 
according  to  their  respective  traits,  as  governed  by  the  interaction  kernel,  G. 


D 

ITt 

D 

m 


P(s)  = 
Z(s')  = 


n(s)N  -  /  ds'  G(s  -  s')Z(s') 


P 


a  /  ds  G(s  —  s')P(s)  —  dz 


Z, 


with  ^  peaked  at  s 
be  preserved: 


0  and  G{s  —  s')  peaked  at  zero.  We  also  require  that  the  total  biomass 


N+  ds  P(s)  +  /  ds'  Z(s'). 


This  system  has  a  singular  solution, 


(PZ)  =  (P5(s),Z5(s')), 


which  is  an  ESS,  but  it  may  be  unstable  if 


|G"(0)|Z>|//(0)|iV. 


or  if 

G(0)  MO)  ’ 

so  that  the  predation  function  is  sharply  curved  and  inhibits  growth  only  in  a  small  neigh¬ 
borhood  of  the  single-phenotype  solutions.  The  figures  show  two  examples  of  P(s,  t )  and 
Z(s't)  for  weakly  and  more  strongly  unstable  situations. 


9  Advection 

Another  way  to  make  the  environment  change  periodically  is  to  make  the  time  derivative 
into  an  advective  derivative,  and  to  impose  a  flow  that  models  ocean  circulation.  In  the 
model  that  was  simulated,  the  “ocean”  is  wind-driven,  two  layers  with  warm  water  above 
the  thermocline  and  cold  water  below),  and  quasi-geostrophic.  Figure  6  shows  a  snapshot 
of  the  mean  s  value  from  such  a  model  with  a  single  nutrient  and  a  single  predator.  The 
diversity  is  highest  in  the  region  where  the  western  boundary  currents  meet  and  proceed 
offshore  as  an  eddying  jet. 
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Figure  5:  Predator-prey  biomass  densities  for  G" /G  =  25  //'/M- 
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Figure  6:  Mean  value  of  s  in  a  two-gyre,  eddying  model. 
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Finding  Lagrangian  structures 
via  the  application  of  braid  theory 


Michael  R.  Allshouse 
October  16,  2010 


1  Introduction 

The  ability  to  accurately  identify  regions  of  mixing  in  two  dimensional  systems  has  appli¬ 
cations  in  ocean  [1]  and  geophysical  systems  [5],  as  well  as  granular  flows  [3].  Over  the  last 
decade,  much  work  has  been  put  into  finding  coherent  structures  in  the  Lagrangian  frame 
([7],  [8],  and  [9]).  Most  of  these  methods  require  field  data  (e.g.  velocity  or  vorticity  fields) 
which  in  situations  like  the  ocean  are  not  readily  available.  However,  trajectories  of  tracers 
for  these  systems  can  accurately  be  measured  and  are  available  for  analysis.  This  indicates 
that  a  method  which  estimates  the  location  of  Lagrangian  structures  based  on  trajectory 
data  would  be  highly  valuable. 

The  Lagrangian  structures  we  are  looking  for  are  defined  as  regions  of  mixing  surrounded 
by  a  transport  boundary.  These  moving  regions  contain  fluid  which  mixes  with  other  fluid 
within  the  boundary  but  not  with  fluid  outside  the  region.  This  self  mixing  region  can 
move  through  the  full  system  and  can  change  shape.  The  transport  boundaries  in  the 
fluid  effectively  divide  up  the  fluid  into  regions  of  qualitatively  different  dynamics.  What 
separates  these  transport  boundaries  from  other  material  lines  in  the  fluid  is  that  the  length 
of  these  boundaries  stays  the  same  order  of  magnitude  throughout  the  time  evolution.  If 
the  material  line  perimeter  stays  of  the  same  order  of  length  then  there  is  negligible  mixing 
occurring  between  fluid  inside  and  outside  of  the  transport  boundary. 

Throughout  this  report,  a  sample  system  will  be  used  to  demonstrate  many  of  the 
features  and  concepts  presented.  The  application  of  this  system  to  two  regions  of  fluid 
is  presented  in  Section  3.3.  Figure  la  depicts  thirty  trajectories  as  well  as  two  regions  of 
interest  (blue  and  green)  in  an  initial  state.  At  a  later  time  the  position  of  the  trajectories 
and  the  regions  of  interest  are  presented  in  Figure  lb. 

While  the  boundaries  of  the  two  regions  distorts  the  trajectories  that  start  within  a 
region  remain  in  that  region  for  all  time.  An  important  observation  is  that  while  the  blue 
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(a)  (b) 

Figure  1:  (a)  Initial  state  (b)  State  after  application  of  system  described  in  Section  3.3. 
The  blue  region  is  surrounded  by  a  mixing  transport  boundary.  The  green  region  is  not. 
Float  positions  are  shown  as  red  dots. 


region  has  changed  shape,  the  perimeter  has  remained  approximately  the  same  length.  This 
indicates  that  the  boundary  to  the  blue  region  would  be  classified  as  a  mixing  transport 
boundary.  The  green  region  shows  substantial  folding  and  stretching  of  the  boundary  as 
trajectories  from  outside  the  region  try  to  mix  with  points  inside  the  region.  Thus  it  can 
be  concluded  that,  this  boundary  would  not  be  classified  as  a  mixing  transport  boundary. 

The  goal  of  this  project  is  to  develop  a  method  for  finding  Lagrangian  structures,  like 
the  boundary  of  the  blue  region  in  Figure  1,  from  trajectory  data.  We  will  use  the  property 
of  a  negligibly  growing  perimeter  as  the  differentiating  characteristic  between  transport 
boundaries  and  other  material  lines  in  the  fluid.  In  order  to  do  this,  we  have  developed  an 
algorithm  that  analyzes  the  trajectory  data  and  makes  predictions  of  where  these  structures 
are  located.  The  novelty  of  this  algorithm  is  its  reliance  on  the  tools  of  braid  theory. 

The  concept  of  using  braid  theory  for  the  analysis  of  trajectories  was  presented  by 
Thieffeault  [11]  in  the  context  of  quantifying  Lyapunov  exponents  in  mixing  two  dimensional 
fluids.  He  applies  tools  from  braid  theory  to  calculate  the  topographical  entropy  of  a  system 
which  is  a  measure  of  the  mixing  occurring  in  the  system.  The  present  research  takes  this 
work  as  a  basis  but  develops  some  of  the  concepts  in  a  direction  which  finds  the  mixing 
transport  boundaries  in  the  system. 

This  report  is  structured  such  that  a  number  of  definitions  and  an  introduction  of 
the  necessary  background  of  braid  theory  is  first  presented.  This  will  be  followed  by  a 
demonstration  that  the  tools  from  braid  theory  can  find  mixing  structures.  Next,  a  revised 
algorithm  will  be  presented  which  will  make  a  braid  theory  based  method  a  reasonable 
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option  for  analyzing  large  numbers  of  trajectories  in  search  of  transport  boundaries.  Finally, 
an  application  of  this  method  to  a  mixing  system  is  presented  with  concluding  remarks  and 
future  works. 


2  Braid  theory 

In  order  to  fully  understand  the  algorithms  presented  in  this  report,  a  number  of  definitions 
and  tools  from  braid  theory  will  be  provided  here.  These  definitions  will  be  given  in  the 
context  of  applying  this  theory  to  a  set  of  trajectories  evolving  in  time  on  a  two  dimensional 
plane.  An  example  set  of  trajectories  for  three  floats  are  shown  in  Figure  2a  to  provide  the 
reader  with  a  visual  example  to  follow. 

The  trajectories  are  constrained  to  two  dimensions  in  the  physical  space,  and  the  position 
of  a  float  at  any  given  time  will  be  referred  to  as  a  puncture.  For  example,  there  are  three 
punctures  in  Figure  2a  and  are  represented  by  colored  dots.  The  initial  positions  of  the 
punctures  are  also  presented  in  the  figure  as  crosses. 

The  trajectories  that  the  punctures  traverse  can  be  projected  into  a  three  dimensional 
space  where  the  horizontal  plane  is  the  physical  domain  and  the  vertical  axis  is  time.  These 
three  dimensional  trajectories  are  called  strands.  Due  to  the  monotonic  nature  of  time,  a 
strand  can  only  travel  upwards  because  it  can  not  travel  backwards  in  time.  If  these  three 
dimensional  strands  are  projected  onto  the  plane  containing  the  x-axis  and  time  then  Figure 
2a  becomes  Figure  2b. 

The  collection  of  strands  make  up  what  is  called  the  braid.  The  braid  is  uniquely  defined 
by  how  the  strands  intertwine,  or  cross  in  the  projected  view.  The  strands  itself  can  be 
perturbed  and  the  braid  will  be  considered  unchanged  as  long  as  the  perturbations  do  not 
change  the  direction,  number  or  order  of  the  crossings.  The  braids  shown  in  Figure  2b  and 
2c  are  the  same  braid  even  though  the  strands  have  been  transformed  to  a  different  location 
and  over  a  different  time.  They  reason  the  two  images  depict  the  same  braid  is  because 
the  green  strand  passes  in  front  of  the  black  strand  first  then  the  black  in  front  of  the  red 
strand  in  both  cases. 

2.1  Generators 

Given  the  importance  of  the  crossing  of  strands  in  the  braid,  it  is  necessary  to  define  what  in 
braid  theory  are  called  generators,  a.  Generators  act  on  a  braid  by  taking  a  pair  of  strands 
and  crossing  them.  For  example  the  braid  shown  in  Figure  2  has  two  generators  acting  in 
a  sequence.  The  first  generator  causes  the  green  strand  to  pass  in  front  of  the  black  strand, 
or  equivalently  causes  the  strand  in  the  first  index  location  to  pass  in  front  of  the  strand  in 
the  second  index  location.  The  second  generator  causes  the  black  or  second  index  strand  to 
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Figure  2:  (a)  Three  sample  trajectories  shown  in  the  physical  plane.  An  “x”  represents 
the  initial  condition  of  the  trajectory  and  a  dot  represents  the  current  position,  (b)  The 
projection  of  the  trajectories  onto  the  x-axis  as  a  function  of  time.  When  a  trajectory  passes 
behind  another  trajectory  a  gap  is  placed  in  the  trajectory,  (c)  The  transformation  of  the 
braid  in  (b)  to  a  plot  of  their  indexes  as  a  function  of  time. 


pass  in  front  of  the  red  or  third  index  strand.  Note  that  the  black  strand  has  moved  from 
the  first  to  the  second  to  the  third  index  during  the  application  of  these  generators. 

Based  on  the  description  of  what  the  generator  does  it  is  clear  that  there  are  two 
defining  characteristics:  first,  what  strands  are  involved  in  the  crossing  and  second,  which 
strand  passes  in  front.  These  two  components  can  be  used  to  give  a  numerical  value  for  a 
specific  generator.  The  strands  involved  in  the  crossing  are  indicated  by  the  magnitude  of 
the  generator,  and  the  direction  of  crossing  is  indicated  by  the  generator  being  positive  or 
negative.  Since  the  two  strands  which  cross  must  be  right  next  to  each  other,  the  magnitude 
of  the  generator  is  equal  to  the  lower  index  of  the  strands  involved  in  the  crossing. 

For  example,  the  braid  shown  in  Figure  2c  has  two  generators  acting  on  three  strands. 
The  first  generator  crosses  the  lower  index  strand,  black,  behind  the  higher  index  strand. 
This  direction  of  crossing  is  defined  as  positive.  The  two  strands  involved  in  the  first  crossing 
are  in  positions  one  and  two.  This  indicates  that  the  first  generator  is  a \  =  1.  The  same 
procedure  can  be  repeated  for  the  second  crossing  which  takes  the  strand  in  index  two  and 
crosses  it  in  front  of  the  strand  in  index  three.  Thus,  the  second  generator  is  02  =  —2.  In 
this  case,  the  generator  sequence  a  =  [1  —  2]  fully  defines  the  three  strand  braid.  It  should 
be  stressed  that  the  actual  strand  involved  in  a  crossing  does  not  mater  just  the  indexes 
where  crossing  is  taking  place. 

2.2  Loops 

The  next  tool  which  will  be  used  throughout  this  report  is  the  concept  of  a  loop.  A  sample 
loop  is  drawn  in  Figure  3a.  A  loop  is  a  boundary  of  a  region  that  can  pass  around  or 
encapsulate  punctures  and  provides  a  way  for  us  to  divide  up  the  fluid  system  into  regions. 
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Figure  3:  (a.)  Sample  loop  drawn  for  a  four  puncture  system.  The  fi' s  and  ids  are  presented 
in  the  plot  and  the  corresponding  Dynnikov  coordinates  are  a  =  [1  —  1]  and  b  =  [ 0  1].  (b.) 
How  an  initial  loop  (shown  at  the  bottom)  gets  modified  by  the  generators  for  the  system 
shown  in  Figure  2.  The  number  of  intersection  (from  bottom  to  top)  are  2,3,  and  5. 


If  a  mixing  transport  boundary  exists  and  contains  punctures,  it  should  be  possible  to  find 
a  loop  that  represents  this  boundary.  A  loop  representing  a  mixing  transport  boundary 
must  have  a  length  that  does  not  grow  with  time.  Clearly  if  a  loop  represents  the  transport 
boundary  it  is  important  to  understand  some  characteristics  of  loops. 

The  first  important  concept  is  that  a  loop  can  be  uniquely  defined  by  a  coordinate 
system.  The  coordinate  system  uses  information  on  the  number  of  times  the  loop  passes 
above,  below,  or  between  punctures.  For  example,  the  number  of  times  the  loop  passes 
directly  above  the  second  puncture  is  represented  as  /ii  and  below  the  second  puncture  is 
fj>2  •  Similarly  the  number  of  times  the  loop  passes  above  and  below  the  third  puncture  are 
/X3  and  /U4  respectively.  Additionally  the  number  of  times  the  loop  passes  the  midpoint 
between  punctures  is  also  required  for  the  coordinate  system  representation.  To  visualize 
this,  a  vertical  line  at  the  midpoint  between  punctures  is  drawn  in  Figure  3a  and  the 
intersection  with  the  loop  can  be  counted.  The  three  midpoint  crossings  from  left  to  right 
are  v\ ,  ^2,  and  1/3. 

The  information  about  number  of  passings  can  be  condensed  into  a  minimal  coordinate 
system  called  the  Dynnikov  coordinates  [4].  Taking  the  information  about  number  of  passes, 
the  Dynnikov  coordinates  transform  the  /d s  and  ids  by  the  following  relationship 


CLi  — 


d2 i  ~  ^2i-l 
2 


bi  = 


Vi  ~  Vj+1 
2 


These  coordinates  are  essentially  a  comparison  of  the  number  of  times  a  loop  passes  above 
or  below  the  i  +  1  puncture,  a*,  and  the  difference  in  the  number  of  times  the  loop  passes 
to  left  and  to  the  right  of  the  i  +  1  puncture,  bi.  The  Dynnikov  coordinate  representation 


150 


for  a  given  loop  is  u  =  (ai,  ...,an_2,&i,  ...,6„_i).  This  coordinate  system  uniquely  defines 
any  loop  by  a  sequence  of  integers,  and  any  even  number  of  integers  can  be  represented  as 
a  loop. 

With  the  coordinate  system  in  place  it  is  possible  to  concisely  represent  any  loop  and 
the  next  step  is  to  understand  how  the  loop  evolves  as  the  generators  cause  the  punctures 
to  change  positions.  As  the  generators  are  applied  to  the  strands,  the  loop  surrounding  the 
strands  is  also  moved.  An  example  application  of  generators  to  a  loop  is  shown  in  Figure 
3b.  The  initial  loop  shown  at  the  bottom  of  the  figure  is  first  modified  by  the  generator 
<7i  =  1  then  by  a 2  =  —  2  in  the  same  way  the  strands  from  Figure  2c  are  intertwined. 

One  of  the  strengths  of  braid  theory  is  that  there  is  a  set  of  rules  for  updating  the 
coordinates  presented  in  [10].  These  rules  allow  for  the  rapid  calculates  of  how  the  loops 
changed  position  due  to  a  generator  and  prevent  the  need  for  a  time  intensive  advection 
calculation  which  would  require  a  velocity  field.  The  simple  set  of  update  rules  also  allows 
for  the  analysis  of  a  large  number  of  loops  simultaneously  which  aids  in  the  search  for  the 
mixing  transport  boundaries. 

Since  we  have  defined  mixing  transport  boundaries  as  boundaries  with  approximately 
time  independent  length,  we  need  to  establish  a  way  to  measure  loop  lengths.  Moussafir 
demonstrated  that  the  length  of  a  loop  is  proportional  to  the  number  of  times  the  loop 
crosses  a  line  connecting  all  the  punctures.  Figure  3b  depicts  the  line  connecting  the  punc¬ 
tures,  and  an  intersection  occurs  every  time  the  loop  crosses  this  line. 

Fortunately,  an  intersection  calculation  exists  which  produces  a  value  proportional  to 
the  loop  length.  The  intersection  for  a  given  loop,  L( u)  [10],  is  calculated  by 

n— 3  7i—l 

L( u)  =  |ai|  +  |an_2|  +  ^2  |ai+i  -  a*|  +  ^  |6j|  (1) 

i=  1  i=0 

where 


bo 


7  —  1 


max  \aA  +  max(bi,0) +y  bj 

i<i<n- 2  1  J 

3= 1 


n— 2 

and  bn-i  =  ~bp  -  ^  bj 

7=1 


This  calculation  will  be  used  to  determine  if  a  loop  is  growing  with  time,  and  thus  not 
a  mixing  transport  boundary,  or  if  the  length  stays  of  the  same  approximately  the  same 
length  throughout. 

Finally,  the  set  of  punctures  inside  by  a  non-growing  loop  will  be  referred  to  as  a  structure 
or  puncture  structure  set.  A  puncture  is  considered  to  be  inside  a  loop  if  there  is  no  set  of 
generators  that  will  result  in  the  loop  not  being  both  above  or  below  the  puncture.  The 
punctures  (trajectories)  inside  the  loop  (transport  boundary)  will  intermix  but  not  mix  with 
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the  punctures  outside  of  the  loop. 


2.3  Summary  of  the  application  of  braid  theory 

With  the  tools  and  definitions  developed  in  the  previous  sections,  it  is  now  possible  to  give  an 
overview  of  the  application  of  braid  theory  to  the  problem  of  finding  two  dimensional  mixing 
transport  boundaries.  The  trajectories  are  converted  to  strands  and  a  generator  sequence 
is  calculated  from  the  trajectory  crossings.  A  mixing  transport  boundary  is  the  equivalent 
to  a  loop,  which  does  not  show  long  term  growth,  surrounding  a  subset  of  punctures.  The 
goal  of  this  report  is  to  demonstrate  that  these  loops  do  exist  and  can  be  found. 

The  search  for  structures  and  non-growing  loops  will  be  done  in  two  ways.  The  first 
method  is  a  methodical  search  for  the  non-growing  loops  by  testing  Dynnikov  coordinates 
in  a  sequential  manner  which  is  presented  in  Section  3.  This  method  will  be  improved  upon 
by  a  refined  searching  technique  described  in  Section  4.  For  both  search  methods,  a  system, 
see  Section  3.3,  is  analyzed  containing  a  mixing  transport  boundary. 

One  assumption  made  throughout  this  paper,  is  that  the  mixing  transport  boundaries 
simply  surround  a  region  of  the  fluid.  It  seems  unlikely  that  a  boundary  which  weaves  in 
and  out  of  other  punctures  would  not  grow  with  time.  Based  on  this,  it  is  assumed  that 
the  Dynnikov  coordinates  defining  the  non-growing  loops  will  have  small  magnitudes  for  its 
integer  coordinates,  because  as  the  coordinate  magnitudes  grow  so  does  the  complexity  of 
the  loop. 

3  Proof  of  concept 

The  basis  of  the  proof  of  concept  method  is  that  if  a  simple  non-growing  loop,  or  mixing 
transport  boundary,  exists  it  will  be  found  by  analyzing  the  set  of  all  simple  loops.  Here 
we  present  what  is  classified  as  a  simple  loop,  an  outline  of  the  code  which  calculates  and 
analyzes  the  loops,  and  a  sample  system  and  results  of  the  proof  of  concept  code. 

3.1  Simple  loops 

The  Dynnikov  coordinates  provide  a  compact  way  of  representing  loops,  and  any  list  of 
2n  —  4  integers  can  create  a  loop  for  an  n  puncture  system.  Additionally,  the  loop  grows 
more  and  more  complex  as  the  coordinate  values  grow  larger.  Based  on  the  assumption 
that  the  non- growing  loops  are  simple  loops,  the  proof  of  concept  code  will  only  search 
through  loops  where  each  of  the  initial  coordinates  can  have  a  value  between  —1  and  1. 
Four  example  loops  are  presented  in  Figure  4  demonstrating  the  range  in  complexity  which 
is  encompassed  by  this  range  of  Dynnikov  coordinates. 
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I 


u  =  [1  -1  0-1  1  0  0-1  1  0] 


U  =  [0  00000000  1] 


U  =  [0  00000001  0] 


\ 

IIIM 

\ 


u  =  [1  0  1  0  1  10-1  1-1] 


Figure  4:  Four  loops  which  fall  into  the  category  of  simple. 

3.2  Algorithm  outline 

In  order  to  find  the  non-growing  loops,  an  algorithm  is  developed  using  Matlab.  The  ob¬ 
jective  of  this  algorithm  is  to  take  in  a  set  of  trajectories  and  output  the  non-growing 
loop  entangling  the  punctures.  The  first  step  is  to  calculate  the  generator  sequence  which 
defines  the  braid  of  the  trajectories.  Next,  for  each  Dynnikov  coordinates  which  has  coor¬ 
dinate  values  between  —1  and  1  inclusively,  the  entire  generator  sequence  is  applied  and 
the  resulting  loops  have  their  intersections  calculated.  The  loops  with  the  smallest  final 
number  of  intersections  are  designated  as  non-growing  loops. 

3.2.1  Calculation  of  Generators 

The  generator  sequence  for  a  given  set  of  trajectories  is  calculated  by  a  procedure  called 
the  color  braiding  method.  Pairs  of  strands  are  analyzed  to  determine  when  the  strands 
cross.  This  method  is  called  “colored”  because  each  strand  is  distinguishable  throughout 
searching  through  the  pair  and  it  matters  which  strand  starts  in  the  lower  index  position. 
This  differs  from  the  final  generator  sequence  where  only  the  indexes  of  the  crossing  are 
considered  and  not  which  strands  are  involved  in  the  crossing. 

The  color  braiding  search  starts  first  by  ordering  the  braids  based  on  the  initial  value 
of  their  x— coordinate.  Then  the  code  loops  through  each  pair  of  strands  where  there 
are  (n2  —  n)/ 2  pairs  for  systems  with  n  strands.  For  the  pair  of  strands,  three  pieces  of 
information  are  saved  for  each  crossing:  the  time  of  crossing,  the  direction  (either  clockwise 
or  counter-clockwise  based  on  the  y  value  at  the  time  of  crossing),  and  the  order  of  the 
strands  before  crossing.  After  all  pairs  of  strands  have  been  analyzed,  the  crossings  are 
collected  and  ordered  by  time  of  crossing. 

The  final  step  is  to  convert  the  list  of  crossings  to  a  generator  sequence.  In  the  cur- 
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t crossing 

Direction 

Left  Strand 

Right  Strand  I  =  [12  3], 

1.245 

1 

2 

3  I  =  [  13  2],  £7i  =  2 

2.142 

1 

1 

3  I  =  [3  12],  a2  =  1 

3.145 

-1 

1 

2  /  =  [3  2  1],  erg  =  -2 

Figure  5:  A  sample  system  featuring  three  strands  and  three  crossings.  The  left  table 
demonstrates  the  information  from  the  comparison  between  pairs  of  strands.  The  right  side 
shows  the  index  vector  as  its  being  changed  by  the  crossings  and  the  resulting  generator 
sequence. 

rent  state,  information  on  which  strands  cross  is  available  but  the  generator  sequence  is 
dependent  on  the  index  positions  involved  in  crossings.  This  conversion  is  accomplished 
by  creating  a  vector  naming  each  strand  after  the  strands  initial  index  and  applying  each 
crossing  chronologically.  For  each  crossing,  the  two  strand  names  are  found  and  their  po¬ 
sitions  in  the  vector  are  switched.  The  smaller  index  of  the  strand  locations  becomes  the 
generator  magnitude  and  the  direction  determined  by  color  braiding  is  applied  to  calculate 
the  generator.  The  calculated  generator  sequence  is  then  saved  with  the  corresponding 
times.  A  sample  system  is  shown  in  Figure  5  and  outlines  the  conversion  from  an  ordered 
list  of  color- braiding  information  to  a  generator  sequence.. 

3.2.2  Calculating  loops,  applying  generators,  and  intersection  calculation 

The  next  step  in  the  algorithm  is  to  cycle  through  all  possible  simple  loops  and  apply  the 
generator  sequence.  As  mentioned  in  Section  2.2,  an  n  puncture  system  is  represented  by 
2n  —  4  coordinates  and  only  values  of  —1  ,0,  and  1  will  be  considered  for  each  coordinate. 
This  results  in  3~(2 n  —  4)  —  1  possible  loops  which  must  be  checked  (Note:  the  loop  with 
all  coordinates  equal  to  zero  is  not  considered  a  loop). 

A  simple  methodical  method  for  representing  all  these  loops  is  to  create  a  list  from  1  to 
3~(2n  —  4)  and  convert  the  list  to  their  respective  values  in  base  3.  The  resulting  numbers 
are  then  converted  to  a  string  of  length  2n  —  4  where  the  necessary  number  of  zeros  have 
been  appended  to  the  left  side  of  the  string.  Finally,  each  value  in  the  string  is  reduced  by 
one  producing  the  Dynnikov  coordinates.  With  each  loop  the  generator  sequence  is  applied 
by  utilizing  the  transformation  rules  and  a  final  loop  is  ready  to  be  measured. 

The  last  step  of  the  algorithm  is  to  take  each  final  loop  and  calculate  the  number  of 
intersections.  This  is  a  simple  calculation  based  on  Equation  1.  Given  that  the  number  of 
intersections  is  proportional  to  its  length,  a  criterion  is  set  by  the  user  which  will  define  a 
loop  as  growing  or  not  growing.  If  the  number  of  intersections  falls  below  this  threshold  it 
represents  an  approximation  for  the  location  of  a  mixing  transport  boundary  in  the  fluid. 
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3.3  Modified  Duffing  Oscillator  system 

In  order  to  test  this  algorithm,  it  is  necessary  to  develop  a  system  where  there  is  a  mixing 
transport  boundary.  This  system’s  velocity  field  will  be  used  to  calculate  trajectories  for 
analysis.  While  this  would  not  be  the  target  application  of  the  algorithm,  having  a  test 
case  where  it  is  possible  to  determine  a  priori  which  punctures  make  up  a  structure. 

The  system  selected  for  this  test  is  a  modification  of  the  Duffing  Oscillator.  One  change 
is  made  in  order  to  create  two  mixing  regions  and  the  other  change  adds  solid  body  rotation. 
The  base  system  without  rotation,  x  =  /(x)  is  defined  as 

x  =  y  +  a  cos  (cut) 
y  =  x(l  —  x 2)  +  7  cos(c vt)  —  5y. 


where  a  =  .1,  7  =  .14,  5  =  .08,  and  u  =  1.  To  add  rotation  to  the  system,  the  following 
transformation  is  made 


x  = 

x  = 


[i?(t)]x 


x  +  [i?(t)]x 


X  = 


[fi(<)]-1x  +  [/i(()]/([/;(*)]-1i) 


where 

rR(t\i  =  cos(Df)  sin(fit) 

—  sin(Df)  cos  (fit) 

The  key  feature  of  this  system  is  that  there  are  two  mixing  regions.  With  the  velocity 
field,  it  is  possible  to  find  these  regions  at  the  initial  time,  and  a  plot  of  the  two  regions 
is  presented  in  Figure  6.  This  plot  indicates  that  all  points  initially  in  the  gray  region 
should  make  up  a  structure,  and  all  punctures  initially  in  the  white  region  should  make  up 
a  second  structure.  Each  of  these  structures  should  have  a  non-growing  loop  surrounding 
the  punctures  in  the  given  structure. 

3.4  Results  of  basic  algorithm 

To  test  the  proof  of  concept  algorithm  a  variety  of  sets  of  initial  conditions  are  tested. 
A  single  representative  example  is  presented  here.  This  system  consists  of  seven  initial 
conditions  which  are  shown  in  Figure  6.  Based  on  that  figure,  it  is  expected  that  four  of  the 
punctures  should  make  up  one  structure  and  the  other  three  punctures  will  form  a  second 
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Figure  6:  Mixing  regions  for  the  modified  Duffing  oscillator  system.  The  white  region 
collapses  to  a  chaotic  mixing  region  at  the  center.  The  gray  region  approaches  a  limit 
cycle  around  the  white  region.  The  black  x’s  are  the  initial  conditions  for  the  trajectories 
discussed  in  Section  3.4. 


Figure  7:  (a)  Semi-log  plot  of  the  intersections  as  a  function  of  time  for  all  possible  loops 
with  Dynnikov  coordinate  values  ranging  from  -1  to  1.  (b)  Plot  of  the  intersections  as  a 
function  of  time  for  loops  where  growth  appears  to  be  non-exponential. 


structure.  With  these  trajectories,  the  corresponding  generators  are  calculated  and  the  310 
loops  are  manipulated  via  the  transformation  rules.  The  lengths  of  the  resulting  loops  are 
then  calculated. 

For  a  representative  set  of  loops  in  the  defined  range,  the  log  of  the  intersections  as  a 
function  of  time  are  plotted  in  Figure  7a.  This  plot  indicates  that  there  are  two  regimes 
of  growth:  loops  growing  exponentially  (linearly  increasing  lines  in  the  plot)  and  loops 
that  show  non-exponential  growth  (lines  approximately  flat  in  the  plot).  The  loops  growing 
exponentially  will  not  correspond  to  loops  surrounding  a  structure,  so  they  can  be  neglected. 
It  should  be  noted  though  that  they  all  share  the  quality  that  the  loops  enclose  a  subset  of 
the  four  punctures  which  mix  chaotically  in  the  center  but  never  all  four. 

A  closer  view  of  the  non-exponentially  growing  intersections  are  shown  in  7b.  This  plot 
is  not  on  a  log  scale  and  shows  that  some  of  the  loops  grow  approximately  linearly  while 
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(a) 


Figure  8:  Simplified  loop  diagrams  of  the  loops  which  do  not  show  steady  state  growth,  (a) 
Loop  containing  the  orbiting  trajectories  (black  punctures),  (b)  Loop  containing  the  inner 
trajectories  (red  punctures),  (c)  Double  loop  featuring  both  structure  loops. 

others  don’t  grow  at  all.  The  loops  which  grow  linearly  contain  all  four  trajectories  in  the 
chaotic  region  and  a  subset  of  the  limit  cycle  region  trajectories.  The  loops  which  don’t 
show  growth,  presented  in  Figure  8,  correspond  to  mixing  transport  boundaries.  Changes 
in  the  loop  do  occur  throughout  the  mixing  due  to  the  moving  of  the  particles,  but  there  is 
no  steady  state  growth  of  the  initial  loop. 

The  expected  transport  boundaries  were  found  indicating  that  the  proof  of  concept 
successfully  applied  the  tools  from  braid  theory  to  the  two  dimensional  trajectories.  One 
drawback  from  the  proof  of  concept  method  is  that  it  is  limited  in  application  due  to 
the  run  times.  For  systems  of  eight  or  fewer  punctures,  it  takes  seconds  to  find  the  non¬ 
growing  loops,  but  for  each  trajectory  added  after  eight,  the  run  time  increases  by  almost 
an  order  of  magnitude.  This  is  due  to  the  bottleneck  of  the  code  being  the  application  of 
generators  to  loops.  This  process  is  repeated  for  each  loop  and  the  number  of  loops  searched 
is  3~(2 n  —  4)  ~  9n.  This  fact  renders  the  method  useless  for  systems  with  more  than  eleven 
punctures. 

4  Refined  loop  finding  algorithm 

While  the  methodical  search  through  all  simple  loops  produces  the  non-growing  loops,  it  is 
not  a  realistic  options  for  systems  where  there  are  thirteen  or  more  punctures  (run  times 
would  be  on  the  order  of  four  days).  The  target  application  for  this  method  is  systems  with 
a  large  number  of  tracers  which  mix  well  so  having  a  capacity  for  analysis  of  only  thirteen 
particles  is  not  sufficient.  For  braid  theory  to  be  a  reasonable  method  for  finding  mixing 
transport  boundaries,  it  is  necessary  to  develop  a  code  where  the  time  to  find  non-growing 
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loops  is  not  t  ~  0(9"). 

One  major  drawback  of  the  previous  method  is  that  no  information  is  gained  from  a  loop 
which  is  growing,  so  every  loop  has  to  be  tried.  The  refined  algorithm  tries  a  specific  set  of 
loops  and  analyzes  their  state  after  the  generators  have  been  applied.  The  loops  analyzed 
connect  two  punctures.  By  letting  these  loops  evolve  their  final  state  gives  an  indication 
of  which  punctures  become  entangled  by  the  loop.  Now  links  can  be  drawn  between  which 
punctures  entangle  which  other  punctures.  This  is  the  basis  for  the  refined  algorithm. 

The  objective  of  the  refined  algorithm  is  still  to  take  in  a  set  of  trajectories  and  output 
the  punctures  within  a  structure  and  the  non-growing  loop  which  surrounds  the  loop.  The 
outline  for  this  algorithm  is  still  to  first  calculate  the  generator  sequence  which  defines  the 
braid.  This  generator  sequence  will  be  applied  to  a  set  of  Dynnikov  coordinates  which 
connect  pairs  of  punctures,  described  in  Section  4.1.  The  resulting  loops  are  then  analyzed 
to  determine  which  punctures  are  entangled  by  the  loop,  Section  4.2.  With  the  information 
on  entangled  sets,  a  code  determines  which  punctures  form  a  structure,  Section  4.3.  Finally, 
the  punctures  which  make  up  a  structure  are  surrounded  by  a  loop  which  does  not  grow, 
Section  4.4.  After  the  new  algorithm  has  been  described,  its  application  to  the  modified 
Duffing  oscillator  will  be  presented. 

4.1  Pair- loops 

As  mentioned  in  the  description  of  the  algorithm,  the  loops  of  initial  interest  are  ones  which 
connect  a  pair  of  punctures.  While  there  are  an  infinite  number  of  ways  for  a  loop  to 
connect  punctures,  the  main  concern  here  is  how  simple  loops  entangle  the  other  punctures. 
Given  this  consideration,  punctures  that  neighbor  each  other  will  be  connected  like  loop 
(1,2)  in  Figure  9a.  For  punctures  that  are  not  neighbors,  two  loops  are  created  where  one 
loop  passes  above  intermediate  punctures,  loops  (1,3)  and  (1,4)  in  Figure  9a,  and  the  other 
passes  below,  loop  (3,1)  or  (4,1)  in  Figure  9a. 

For  a  system  of  n  particles  there  are  n  —  1  loops  connecting  neighbors  and  (n  —  2)(n  —  1) 
loops  connecting  above  and  below  resulting  in  a  total  of  (n  —  l)2  loops  to  analyze.  In  the 
last  method,  the  bottle  neck  was  the  application  of  the  generators  to  the  3~(2 n  —  4)  —  1 
loops  causing  the  run  time  to  be  t  ~  0(9n).  Now  if  the  bottle  neck  of  the  algorithm  is  still 
the  application  of  the  generators,  then  at  least  the  run  time  has  become  t  ~  0(n2)  which 
grows  substantially  slower  particularly  for  large  n.  Figure  9a  is  an  example  for  the  initial 
loops  which  are  considered  for  any  five  puncture  system. 
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Figure  9:  (a.)  The  initial  loops  for  a  five  puncture  system.  The  notation  for  each  initial 
loop  is  (a,  b)  where  the  loop  goes  from  a  to  b  in  a  clockwise  direction,  (b.)  The  loops 
which  result  from  applying  a  generator  sequence.  Only  loops  (1,3),  (4,2),  (5,2),  and  (4,5) 
do  not  entangle  all  punctures  after  generator  application.  This  group  of  trajectories  form 
two  puncture  structure  sets  (1,3)  and  (2,4,5) 
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4.2  Calculation  of  entangled  punctures  sets 

After  the  generator  sequence  has  been  applied  to  the  pair-loops,  the  code  needs  to  determine 
which  punctures  are  entangled  by  the  loops.  We  will  describe  a  puncture  as  disentangled 
with  a  loop  if  the  loop  does  not  pass  above  and  below  the  puncture.  To  determine  this 
the  final  loop  Dynnikov  coordinates  are  converted  to  the  PMN-coordinate  system.  The 
PMN-coordinate  system  calculates  the  number  of  times  the  loop  passes  above  (M,;)  and 
below  (TVj)  the  ith  puncture  as  well  as  half  the  number  of  intersections  of  the  loop  with  the 
mid-line  between  punctures  (Pt  =  z/*/ 2).  For  example  the  loop  presented  in  Figure  3a  has 
M  =  [2  1  3  1],  N  =  [2  3  1  1],  and  P=[  2  2  1], 

With  this  new  coordinate  representation,  it  is  straightforward  to  determine  if  a  puncture 
is  entangled  or  not.  For  the  ith  puncture  to  be  disentangled  either  M,  or  Nt  is  zero  for  the 
loop  after  applying  the  generator  sequence.  The  code  searches  for  all  entangled  punctures 
for  a  given  loop  and  then  saves  entangled  set  and  the  corresponding  initial  loop.  This 
process  is  repeated  for  each  pair-loop.  After  all  loops  have  been  processed,  the  entangled 
sets  are  condensed  to  remove  any  repeated  sets. 

For  the  example  presented  in  Figure  9,  the  generators  have  been  applied  to  the  loops  in 
Figure  9a  and  they  produce  the  loops  in  Figure  9b.  Of  the  sixteen  loops  only  four  do  not 
entangle  all  the  punctures.  The  code  only  considers  entangled  sets  which  are  a  subset  of 
the  full  set  of  punctures.  The  loop  (1,3)  entangles  only  punctures  1  and  3.  The  loops  (4,2), 
(5,2),  and  (4,5)  entangle  only  the  punctures  2,  4,  and  5.  This  means  the  example  results  in 
two  entangled  sets  which  also  happen  to  be  the  structures  for  this  system.  It  is  possible  to 
find  a  non-growing  loop  around  punctures  1  and  3  and  one  around  2, 4, and  5. 

4.3  Determining  punctures  in  a  structure 

While  the  example  presented  in  Sections  4.1  and  4.2  produce  the  structure  sets  directly 
from  the  entangled  puncture  sets,  this  is  not  always  the  case.  For  a  pair-loop,  the  two 
punctures  are  either  within  the  same  structure  or  not.  If  a  loop  connects  points  within  a 
structure,  the  loop  can  either  entangle  a  subset  of  punctures  within  the  same  structure,  all 
punctures  within  the  structure,  or  all  punctures  within  the  structure  as  well  as  punctures 
from  another  structure.  In  the  simple  example,  loops  connecting  points  within  a  structure 
always  only  entangled  all  the  points  within  the  same  structure.  Alternatively,  if  a  loop 
connects  punctures  from  different  structures  they  will  entangle  all  the  punctures  from  the 
two  structures  and  possibly  other  structures  as  well.  The  code  attempts  to  take  these 
entangled  sets  and  decipher  which  punctures  form  a  structure. 

To  create  the  structures,  the  code  runs  through  the  entangled  sets  and  allocates  a 
structure  index  to  the  set.  The  entangled  sets  are  read  in  from  largest  to  smallest,  so  the 
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first  and  largest  set  is  labeled  as  structure  1  and  then  the  next  entangled  set  is  analyzed. 

If  the  second  set  intersects  a  structure,  then  all  points  in  the  second  set  are  labeled  with 
the  same  structure  index.  However,  if  the  second  set  is  disjoint  from  the  structures,  it  is 
given  a  new  structure  index.  This  process  is  repeated  for  each  of  the  remaining  entangled 
puncture  sets.  It  is  possible  for  an  entangled  set  to  intersect  multiple  structures  in  which 
case  the  structures  as  well  as  the  entangled  set  will  all  merge  into  one  structure. 

At  the  end  of  this  process,  there  are  two  options  either  a  set  of  disjoint  structures  have 
been  created  or  a  single  structure  containing  all  punctures.  If  the  latter  option  is  produced, 
the  code  reruns  but  instead  starts  with  the  next  smallest  entangled  set  and  repeats  the 
process  till  it  creates  multiple  structures.  Based  on  the  method  of  finding  structures,  it  is 
very  easy  for  multiple  structures  to  get  combined  into  one  larger  structure.  To  combat  this 
issue  the  code  has  the  capability  to  rerun  the  full  algorithm  on  a  structure  to  determine 
if  it  can  be  broken  down  into  smaller  structures.  If  the  code  runs  through  all  the  possible 
starting  positions,  and  it  can  still  only  find  a  structure  containing  all  punctures,  it  concludes 
that  there  are  no  structures  present  and  the  absence  of  non- growing  loops. 

For  clarity,  the  following  is  an  example  where  there  are  three  known  structures  {1,2},  {3, 4} 
and  {5,6}.  The  entangled  sets  are 

{1,2, 3, 4}, 

{3, 4,  5, 6}, 

{1,2}, 

{3, 4}, 

{5,6}. 

Starting  with  the  first  entangled  set  leads  to  the  formation  of  one  structure  after  trying 
four  sets.  Since  only  one  structures  is  found,  the  code  restarts  from  the  second  entangled 
set.  This  results  in  two  structures  {1,2}  and  {3, 4,  5,  6}.  The  first  structure  is  one  of 
known  structure  while  the  second  combines  two  of  the  known  structures.  This  fact  would 
be  resolved  after  rerunning  the  {3, 4,  5,  6}  trajectories  through  the  full  algorithm. 

4.4  Calculating  Dynnikov  coordinates  for  a  structure 

The  final  task  for  the  algorithm  is  to  find  a  loop  for  each  structure  which  surrounds  the 
punctures  and  does  not  grow.  A  loop  connecting  the  structure’s  puncture  set  can  be  viewed 
as  the  sum  of  the  loops  connecting  punctures  in  a  structure.  An  example  is  presented  in 
Figure  10  where  three  loops  connect  the  punctures  in  a  structure  then  are  merged  to  form 
one  loop.  The  task  of  this  portion  of  the  algorithm  is  to  determine  how  to  connect  the 
punctures  in  a  structure  then  merge  these  loops  together. 

If  a  loop  surrounding  a  structure  does  not  grow  and  it  is  composed  of  links  between 
punctures,  then  the  links  necessarily  should  not  grow  due  to  being  mixed  by  punctures 
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Figure  10:  Punctures  1,2,4,  and  6  are  in  a  structure,  (a.)  The  loops  shown  here  connect 
the  punctures.  Blue  connects  1  to  2,  red  connects  2  to  6,  and  green  connects  6  to  4.  (b.) 
The  connecting  loops  can  be  merged  to  form  a  single  loop  around  the  punctures. 

outside  the  structure.  This  means  that  if  all  punctures  from  a  structure  are  removed  except 
for  two  that  are  linked  then  the  loop  that  links  the  remaining  structure-punctures  should 
not  grow.  From  the  example  in  Figure  10,  if  punctures  4  and  6  were  removed,  the  loop 
connecting  punctures  1  and  2  should  not  grow  in  a  braid  of  strands  1,2,3,  and  5.  The  code 
takes  advantage  of  this  property,  in  order  to  find  the  links. 

This  portion  of  the  code  takes  in  the  punctures  that  are  in  a  structure  and  outputs 
the  loop  surrounding  the  punctures  which  does  not  grow.  Using  the  property  previously 
described  as  the  basis  for  this  method,  the  code  selects  a  pair  of  punctures  in  the  structure 
then  calculates  the  generator  sequence  for  a  braid  containing  only  these  two  punctures  and 
the  punctures  not  in  the  structure.  With  the  generator  sequence  simple  loops  connecting  the 
two  punctures  of  interest  are  created  and  the  generators  are  applied.  The  code  calculates 
the  number  of  intersections  and  determines  if  there  is  a  non-growing  loop.  Once  all  the 
punctures  are  linked  together,  the  loops  are  merged  and  the  final  loop  is  analyzed  to  confirm 
it  does  not  grow.  This  concludes  the  algorithms  task. 

4.5  Application  to  modified  Duffing  oscillator 

The  goal  of  the  refined  method  is  to  turn  the  use  of  the  braid  theory  into  a  potential 
method  for  finding  mixing  transport  boundaries.  The  run  times  from  the  proof  of  concept 
code  significantly  limit  that  approaches  applicability,  but  by  changing  the  number  of  loops 
searched  from  0(?r2),  there  is  potential  in  the  refined  method.  For  a  direct  comparison, 
a  set  of  five  n  puncture  systems  are  randomly  created  and  run  for  both  algorithms.  The 
results  of  these  times  are  presented  in  Table  1. 

The  run  time  results  demonstrate  the  drastic  advantages  that  the  refined  method  offers. 
While  systems  with  fewer  than  nine  punctures  actually  complete  their  runs  in  a  faster  time 
using  the  proof  of  concept  algorithm,  the  refined  algorithm  allows  the  braid  theory  approach 
to  be  applied  for  systems  with  much  larger  numbers  of  punctures.  While  the  time  growth 
rate  for  the  proof  of  concept  clearly  becomes  the  predicted  t  ~  0( 9n),  the  refined  algorithm 
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#  of  punctures 

5 

6 

7 

8 

9 

10 

11 

20 

Proof  of  Concept 

0.33 

0.46 

0.70 

5.98 

53.20 

462.18 

3445.39 

N/A 

Refined 

6.71 

9.46 

11.61 

12.34 

12.87 

15.71 

20.48 

127.86* 

Table  1:  Run  times  for  the  modified  Duffing  oscillator  using  the  proof  of  concept  and  refined 
algorithms.  *  The  run  with  20  generators  had  over  four  times  as  many  generators  which 
contributes  to  this  being  higher  than  the  trend  would  suggest. 

does  not  demonstrate  a  specific  trend  in  growth  time.  This  is  due  to  the  fact  that  the 
application  of  generators  is  not  the  bottle  neck  of  the  algorithm.  The  larges  amount  of  time 
is  spent  calculating  the  non-growing  loop  from  the  structures. 

5  Finding  non-growing  loops  in  a  rod  mixed  system 

The  final  step  in  this  report  is  to  apply  this  algorithm  to  an  actual  fluid  system.  The 
system  selected  for  a  first  application  test  is  the  mixing  performed  by  the  translating- 
rotating  mixing  system  presented  by  Finn  and  Cox  [6].  Their  mixing  mechanism  features  a 
circular  domain  of  fluid  and  a  circular  stirring  rod  located  somewhere  in  the  fluid.  The  rod 
has  the  capability  of  both  translation  and  rotation  about  its  center  and  the  outer  boundary 
of  the  domain  has  the  ability  to  rotate  about  its  axis.  The  authors  are  able  to  develop  an 
analytic  solution  for  the  fluid  motion  when  the  rod  is  moved  slowly  through  the  fluid. 

An  interesting  result  from  this  type  of  mixing  is  the  appearance  of  islands  of  poor  mixing 
when  the  circular  rod  traverses  a  periodic  trajectory  through  the  fluid.  These  islands  have 
a  transport  boundary  around  them  which  remains  the  same  order  of  size  and  mixing  inside 
and  outside  don’t  interact.  The  presence  of  these  islands  is  undesirable  because  the  purpose 
of  the  mixing  is  to  homogenize  the  entire  domain.  With  the  authors  analysis  it  is  possible 
to  make  accurate  estimations  on  the  size  and  locations  of  these  islands  by  calculating  an 
iterated  or  Poincare  map.  Our  objective  is  to  determine  if  the  refined  braid  theory  algorithm 
can  also  find  these  islands. 

A  sample  system  is  specified  by  setting  the  trajectory  of  the  rod,  presented  in  Figure 
11a,  and  a  set  of  initial  conditions  for  the  trajectories,  presented  in  Figure  lib.  A  set  of 
forty  initial  conditions  with  half  distributed  in  seven  islands  and  the  other  half  distributed 
throughout  the  well  mixed  chaotic  sea.  The  rods  position  is  also  taken  as  a  trajectory 
bringing  the  total  to  forty-one.  The  initial  conditions  are  tracked  for  five  periods  of  the 
mixing,  and  their  trajectories  produce  a  seven  thousand  generator  long  sequence. 

The  application  of  the  refined  algorithm  takes  approximately  five  and  a  half  minutes 
and  produces  the  results  roughly  drawn  in  Figure  12.  The  loops  plotted  are  drawn  by  hand 
but  accurately  represent  the  non-growing  loops.  An  algorithm  to  represent  the  non-growing 
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Figure  11:  (a.)  Mixing  geometry  for  sample  system,  (b.)  Iterated  map  demonstrating  the 
islands  and  the  initial  positions  of  the  forty  trajectories. 


Figure  12:  Mixing  rod  system  with  the  initial  conditions  and  the  non- growing  loops  drawn 
in. 
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loops  in  development. 

The  first  observation  is  that  not  all  of  the  islands  have  a  non-growing  loop  around  them. 
The  reason  for  this  is  that  there  are  two  islands  which  don’t  have  any  trajectories  in  them 
and  two  with  only  one  trajectory.  This  algorithm  relies  on  multiple  trajectories  being  inside 
a  mixing  transport  region  in  order  to  find  them.  If  there  are  no  trajectories,  then  the  code 
is  unaware  that  the  region  exists,  and  if  there  is  only  one  trajectory  then  the  code  does  not 
recognize  a  non-growing  loop  around  a  single  point. 

The  other  observation  is  that  some  of  the  loops  include  nearby  trajectories  in  the  chaotic 
sea.  The  largest  loop  and  the  loop  at  the  bottom  of  Figure  12  both  are  examples  of  non¬ 
growing  loops  beyond  the  boundaries  of  the  island.  The  justification  for  this  is  that  in  the 
period  of  time  analyzed,  none  of  the  punctures  in  the  chaotic  sea,  the  other  islands,  or  the 
rod  pass  between  the  islands  and  its  nearby  neighbors  which  end  up  in  the  structure.  If 
more  time  were  analyzed  or  a  larger  number  of  punctures,  the  probability  that  a  puncture 
would  pass  between  the  island  and  the  nearby  puncture  becomes  higher,  and  if  this  passing 
were  to  occur  the  code  would  find  a  loop  around  only  punctures  in  the  island. 

6  Conclusion 

This  report  has  presented  the  concept  of  using  braid  theory  to  find  mixing  transport  bound¬ 
aries  in  two  dimensional  systems  using  float  data.  Introductory  information  on  braid  theory 
is  provided  for  a  foundation  to  the  reader,  and  then  a  proof  of  concept  algorithm  is  devel¬ 
oped  and  demonstrated.  With  the  conclusion  that  the  proof  of  concept  algorithm  is  not 
applicable  to  systems  with  more  than  thirteen  trajectories,  a  refined  algorithm  is  presented 
which  uses  a  pair  wise  method  for  searching  for  non-growing  loops.  This  method  success¬ 
fully  reduced  the  computation  time  and  significantly  increased  the  number  of  trajectories 
which  can  be  analyzed.  This  algorithm  is  then  applied  to  the  real  fluid  system  of  a  viscous 
fluid  being  mixed  by  a  moving  rod. 

Some  observations  on  the  limitations  of  this  algorithm  are  presented  in  Section  5,  but 
there  are  more  issues  than  these  system  specific  considerations.  In  order  for  the  algorithm  to 
work  it  is  necessary  that  the  significant  mixing  is  occurring  in  the  system.  The  accuracy  of 
the  sets  of  punctures  forming  structures  is  directly  proportional  to  the  number  of  generators 
in  the  system. 

Another  potential  issue  is  that  the  float  trajectories  have  to  be  highly  accurate.  As 
previously  stated,  it  is  necessary  to  have  the  correct  generator  sequence  in  order  to  find 
loops  which  do  not  grow.  If  the  position  data  does  not  have  a  high  resolution,  it  is  possible 
a  crossing  will  be  missed  leading  to  inaccurate  structures  and  false  non-growing  loops.  While 
it  is  straightforward  to  refine  a  trajectory  from  a  velocity  field,  this  may  not  be  an  option 
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for  trajectory  data  taken  from  a  float  in  the  ocean. 

Even  though  these  issues  do  limit  the  application  of  the  refined  algorithm  and  braid 
theory  somewhat,  there  is  still  the  potential  to  find  structures  in  well  mixed  systems  with 
a  large  number  of  particles.  One  potential  area  of  application  is  the  search  for  clustering 
in  granular  flow  data.  There  is  a  need  to  be  able  to  find  clusters  of  particles  which  move 
together  and  mix  amongst  themselves  [2].  The  refined  algorithm  could  be  used  to  take 
trajectories  of  grains  and  find  these  clusters. 

In  addition  to  finding  applications  of  this  algorithm  there  are  a  number  of  directions  for 
future  work.  One  improvement  on  the  algorithm  would  be  to  add  the  ability  to  plot  the  non¬ 
growing  loops  on  the  data  as  the  system  evolves.  Current  attempts  to  plot  the  non- growing 
loops  has  been  unsuccessful.  It  would  also  be  valuable  to  determine  the  success  rates  of  the 
refined  code  for  various  crossings  per  trajectory  values.  This  would  give  a  sharper  range  of 
applicability  of  the  refined  algorithm. 
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1  Introduction 

Cave  Rings  are  cave  formations,  or  speleothems,  consisting  of  thin  circles  on  cave  floors, 
normally  found  surrounding  a  water  splash  point,  beneath  stalactites.  The  rings  range  in 
diameter  from  a  few  centimetres  to  roughly  two  metres,  and  the  width  and  height  of  the 
deposit  ranges  from  a  few  millimetres  to  a  few  centimetres.  Examples  from  caves  in  Italy 
can  be  seen  in  figure  1.  Elliptical  formations  have  also  been  observed  in  locations  with 
tilted  cave  floors.  The  rings  tend  to  occur  in  locations  where  the  floor  is  without  strong 
flows,  and  with  a  layer  of  dust  or  mud,  although  it  is  not  clear  whether  this  merely  aids 
their  discovery,  or  is  related  to  their  formation. 
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Figure  1:  Rings  photographed  in  a)  Grotta  Imbroglita  and  b)  Grotta  del  Secchio  in  Italy, 
from  [9] 
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There  are  currently  two  theories  within  the  caving  community  as  to  the  method  of  ring 
formation.  The  first,  designated  the  ‘splash  ring’  mechanism,  is  thought  to  be  the  method 
responsible  for  producing  the  larger  rings,  and  postulates  that  the  rings  are  formed  due  to 
the  accumulation  of  deposits  from  secondary  drops  being  ejected  from  the  primary  drop 
site  [5,  9].  This  requires  that  the  majority  of  the  secondary  drops  fall  to  the  same  radius. 
This  is  explained  in  [5]  as  being  due  to  the  majority  of  the  drops  ejecting  at  45°  and  then 
travelling  a  distance  determined  by  the  kinetic  energy  of  the  primary  drop,  dependent  on 
the  height  of  the  source  above  the  floor. 

The  second  theory,  designated  the  ‘fall  down  ring’  mechanism,  thought  to  be  responsible 
for  the  smaller  sized  rings,  was  first  proposed  in  [8]  and  theorises  that  the  rings  are  formed 
due  to  the  accumulation  of  secondary  drops  produced  from  the  spontaneous  breakup  of  the 
primary  drop  during  its  fall.  In  [9],  Nozzoli  et  al.  claim  to  prove  through  analysis  of  collected 
data  from  various  rings  that  this  is  in  fact  the  method  responsible  for  the  formation  of  all 
cave  rings.  They  suggest  that  all  of  the  primary  drops  break  up  at  142.7T7.2  cm  from  the 
stalactite  tip  and  then  fall  in  a  negative  arc  to  the  floor,  due  to  ‘other  forces’  than  gravity 
acting  on  the  small  secondary  drops. 


(a) 


Figure  2:  Depiction  of  a)  the  ‘splash  ring’  hypothesis  and  b)  the  ‘fall  down  ring’  hypothesis. 

The  aim  of  this  project  is  to  show,  through  analysis  and  experiments,  that  both  the 
‘splash  ring’  and  ‘fall  down  ring’  hypotheses  are  clearly  false.  We  also  test  a  new  hypothesis, 
that  cave  rings  are  formed  in  the  same  manner  as  coffee  rings [3],  that  is,  due  to  the  enhanced 
deposition  at  the  edges  of  sessile  drops. 


2  Experimental  Setup 

In  the  following  section,  the  experimental  setup  used  throughout  is  described.  A  depiction 
of  the  equipment  used  can  be  seen  in  figure  3.  Throughout  salt  water  was  used  as  an 
analogue  to  the  water  found  in  caves.  The  accuracy  of  this  replacement  is  discussed  in 
section  6 

A  black  aluminium  sheet,  122  cm  by  66  cm,  was  used  as  a  surface  which  could  be  easily 
heated  and  also  provide  contrast  to  the  white  salt  crystals.  This  was  raised  a  few  cm  off 
the  floor,  and  levelled  using  a  spirit  level,  although  the  sheet  was  not  flat  to  high  precision. 
Underneath  were  6  fluorescent  light  tubes,  centred  in  the  width  of  the  sheet  and  evenly 
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Figure  3:  Depiction  of  the  experimental  setup  used  throughout. 


distributed  along  the  length  of  the  sheet.  These  were  on  for  the  duration  of  all  experiments, 
and  left  on  for  several  hours  before  any  experiment  began,  to  provide  even  heating  to 
encourage  evaporation,  reducing  the  time  scales  of  the  experiments.  The  sheet  reached 
temperatures  of  25-27° C,  and  was  fairly  consistent  in  time,  though  not  evenly  distributed 
across  the  sheet. 

Above  this  sheet  was  a  system  of  rubber  tubing  connecting  a  variable  low  flow  pump 
with  a  source  of  salt  solution,  which  was  varied  in  concentration  for  different  experiments. 
The  solution  was  pumped  at  a  constant,  controllable  rate.  The  calibration  of  the  pump  was 
carried  out  by  collecting  the  liquid  pumped  over  a  period  of  time  for  a  specific  setting,  then 
using  a  combination  of  accurate  weight  and  density  measurements,  the  volume  pumped 
per  hour  was  found.  It  was  noted  that  the  rate  was  very  consistent  between  different 
measurements.  The  end  of  the  tube  was  suspended  over  the  table  using  system  of  pulleys 
and  string,  and  thus  the  height  above  the  table  could  be  adjusted  between  0  and  2m. 

A  Nikon  D200  model  digital  SLR  camera  was  set  up  on  a  tripod  next  to  the  table  to 
record  images  from  experiments.  This  was  set  on  a  timer  to  take  images  at  15  minute 
intervals  throughout  all  experiments.  The  images  were  then  processed  using  the  Image 
Processing  Toolbox  in  the  Matlab  2010a  software  program. 

3  Splash  Ring  Conjecture 

3.1  Literature 

The  ‘splash  ring’  conjecture  is  described  in  [5].  It  is  claimed  that  45°  is  the  most  probable 
angle  for  secondary  drops  to  be  ejected  at,  and  that  the  subsequent  radius  of  the  ring  is 
determined  by  the  amount  of  kinetic  energy  imparted  to  the  primary  drop  on  its  fall  from  the 
stalactite  tip.  This  has  several  logical  flaws,  the  most  obvious  being  the  assumption  that  the 
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secondary  drops  will  almost  always  gain  the  same  amount  of  kinetic  energy  from  the  splash, 
and  therefore  land  at  the  same  radius  from  the  splash  site  every  time.  The  work  of  [1]  shows 
that,  in  the  case  of  drops  hitting  a  very  shallow  liquid  layer,  as  is  likely  under  stalactites, 
there  is  a  range  of  emission  angles  and  a  wide  range  of  emission  velocities,  resulting  in 
a  range  of  splash  radii.  The  literature  surrounding  the  investigation  of  the  transport  of 
soil  and  plant  pathogens  via  raindrops  uses  a  ‘Fundamental  Splash  Distribution  Function’ 
which  assumes  that  the  mass  splashed  falls  exponentially  with  radius,  see  e.g.  [10],  and  is 
confirmed  experimentally. 

3.2  Experiment 

In  order  to  experimentally  disprove  the  splash  ring  conjecture,  the  setup  described  in  sec¬ 
tion  2  was  used,  with  the  source  raised  2nr  above  the  sheet,  and  a  15%  salt  solution  was 
dripped  at  a  rate  of  17.7  ±0.5  ml  per  hour.  The  resulting  pattern  of  salt  crystals  left  on  the 
sheet  after  19  hrs  15  min  can  be  seen  in  figure  4(a),  where  the  image  has  been  manipulated 
to  highlight  the  location  of  the  salt  crystals  as  follows,  using  the  Image  Processing  Toolbox 
in  Matlab  2010a: 

1.  Background  gradients  due  to  lighting  removed. 

2.  Image  converted  to  greyscale. 

3.  Exposure  adjusted  to  rescale  pixels  with  an  intensity  index  of  greater  than  0.5  to  an 
index  of  1. 

4.  Image  converted  to  black  and  white. 

5.  Image  taken  before  experiment  started  removed  to  eliminate  background  items. 

The  pattern  seen  in  the  bottom  right  of  the  photo  is  due  to  reflections  of  drops  from  a  piece 
of  equipment  located  next  to  the  sheet,  and  so  all  numerical  calculations  from  this  image 
use  the  upper  half  of  the  splash  pattern  only.  This  experiment  was  not  designed  to  imitate 
cave  conditions,  but  to  demonstrate  clearly  the  distribution  of  secondary  drops  produced 
by  a  single  drop. 

The  distribution  of  salt  coverage,  a  proxy  for  drop  mass,  versus  radius  can  be  seen  in 
figure  4(b),  as  calculated  from  the  image  in  figure  4(a).  The  salt  coverage  ratio  is  calculated 
from  the  ratio  of  white  to  black  pixels  within  a  given  radius  range.  The  radius  is  calculated 
by  finding  the  length  of  a  pixel  in  an  image  using  the  known  dimensions  of  the  sheet.  At 
small  radii,  there  is  no  salt  due  to  the  constant  recycling  of  the  water  beneath  the  source 
preventing  crystallisation  from  occurring.  From  roughly  4-10cm,  the  surface  is  completely 
covered  in  salt  crystals,  and  so  the  Salt  Coverage  Ratio  is  saturated  at  a  value  of  1.  The 
method  of  using  photographs  from  above  the  table  means  that  the  thickness  of  the  salt  at 
this  point  cannot  be  determined.  The  fact  that  some  radii  appear  to  have  coverage  ratios 
greater  than  one  is  due  to  the  limitations  of  the  technique  employed:  because  the  camera 
could  not  be  placed  directly  over  the  sheet  due  to  the  presence  of  the  source  equipment,  the 
pictures  are  taken  at  a  slight  angle,  and  so  each  pixel  is  not  the  same  length,  as  assumed  in 
calculations.  There  are  also  inherent  errors  in  taking  radial  calculations  from  gridded  data. 
However,  this  experiment  was  not  intended  to  provide  accurate  numerical  data,  merely 
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to  provide  a  qualitative  demonstration  of  the  inaccuracy  of  the  ‘splash  ring’  conjecture. 
The  clear  exponential  tail  to  the  distribution  at  radii  greater  than  10cm  shows  qualitative 
agreement  with  the  literature  as  discussed  in  section  3.1. 

To  determine  whether  the  presence  of  a  central  stalagmite  could  have  an  effect  on  the 
location  of  splashes,  the  same  experiment  was  repeated,  but  with  an  upturned  Pyrex  beaker 
placed  under  the  centre  of  the  source,  see  figure  5(a).  A  20%  salt  solution  was  dripped  at 
a  rate  of  10.1  ±  0.6  ml  per  hour,  and  the  resulting  pattern  left  after  21.5  hours  can  be  seen 
in  figure  5(b).  Clearly  there  is  no  ring  formed,  and  the  pattern  formed  is  similar  to  that 
formed  with  the  stalagmite  absent,  although,  as  the  flow  rate  was  lower,  the  pattern  is  not 
as  large. 


4  Fall  Down  Ring  Conjecture 

4.1  Literature 


The  ‘fall  down’  ring  conjecture  was  first  proposed  in  [8]  as  an  explanation  for  the  smaller 
sized  cave  rings  observed.  In  [9],  Nozzoli  et  al.  claim  to  prove  that  all  cave  rings  are 
produced  by  this  method.  They  collected  data  from  67  rings  in  5  caves  in  Central  Italy,  as 
well  as  creating  some  artificial  rings  by  placing  wooden  tablets  covered  in  soot  underneath 
several  stalactites,  over  rings.  The  data  they  collected  can  be  seen  in  figure  6,  where  an 
apparent  correlation  between  the  height  of  the  source  stalactite  and  the  ring  radius  can  be 
seen.  They  fit  this  data  to  a  quadratic  function,  and  then  conclude  that  all  the  rings  are 
produced  by  drops  that  break  up  at  142.7 ±  7.2  cm  below  the  source  stalactite,  then  execute 
a  negative  arc  to  the  ring  radius.  To  conclude,  because  a  curve  can  be  fitted  to  the  data, 
that  the  theory  must  therefore  be  correct,  is  optimistic  at  best.  There  could  be  many  ways 
to  explain  a  relationship  between  the  two  quantities  measured,  and  there  is  no  explanation 
as  to  the  mechanism  behind  the  proposed  breakup,  or  the  physically  unlikely  negative 
curvature,  which  is  put  down  to  ‘unknown’  forces.  There  is  also  an  apparent  contradiction 
in  their  explanation  -  if  the  drops  always  break  up,  then  what  forms  the  central  stalagmite 
visible  at  the  centre  of  all  the  rings?  The  artificial  rings,  which  they  claim  were  formed  on 
the  wooden  tablets  over  a  period  of  roughly  20  days,  would  perhaps  be  more  convincing  if 
the  paper  contained  any  pictures  of  these  rings  or  a  fuller  explanation  of  what  was  found. 
The  only  thing  to  be  concluded  from  this  work  is  that  any  successful  production  theory 
must  explain  this  apparent  correlation  between  source  height  and  ring  radius. 

A  review  of  the  literature  involving  drop  breakup  revealed  the  extensive  study  of  Viller- 
rnaux  and  Bossafll]  into  the  dynamics  of  single  drop  deformation  and  breakup,  with  the 
aim  of  explaining  the  distribution  of  raindrop  sizes  found  at  ground  level.  The  condition 
for  drop  break  up  is  We  >  6,  where  We  is  the  Weber  number  of  the  drop.  For  such  a  drop, 
first  it  pancakes  and  then  the  radius  increases  exponentially  with  time  until  breakup.  If 
We  <  6,  the  drop  pancakes  but  then  oscillates  around  a  mean  radius.  For  a  drop  falling  in 
static  air,  the  Weber  number  is  defined  as  the  ratio  between  the  gravitational  force  on  the 
droplet  and  the  surface  tension  on  the  drop: 


We  = 


pgd 2 

a 


(1) 
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(a) 


19.25  hrs 


(b) 

Figure  4:  Experiment  described  in  section  3.2  after  19.25  hours,  (a)  shows  a  photo  of  the 
sheet,  with  the  exposure  manipulated  to  highlight  the  salt  crystals,  (b)  shows  the  salt 
coverage  ratio  versus  radius,  calculated  from  the  image  in  (a),  showing  a  clear  exponential 
tail  to  the  distribution. 


174 


(a)  (b) 


Figure  5:  Experiment  as  described  with  Pyrex  beaker  to  represent  a  stalagmite,  (a)  shows 
the  beaker  in  place  on  the  aluminium  sheet,  (b)  shows  a  manipulated  photo  to  highlight 
the  salt  crystals  present  after  21.5  hours. 


0  50 

0i  (cm) 


Figure  6:  Figures  from  [9],  showing  a)  the  measured  height  of  the  source  above  the  rings  vs 
the  radius  of  the  ring,  for  the  in  situ  rings  and  the  artificial  rings  (large  red  symbols).  The 
solid  black  line  shows  the  best  fit  2nd  degree  polynomial,  b)  shows  the  distribution  of  the 
difference  between  the  best  fit  height  and  the  actual  height. 
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where  p  is  the  density  of  the  water,  g  is  gravitational  acceleration,  d  is  the  diameter  of 
the  drop  and  <7  is  the  surface  tension  between  the  water  and  the  air.  For  a  pendant  drop, 
such  as  one  forming  on  a  stalactite,  at  detachment  the  gravitational  force  on  the  drop  just 
exceeds  the  surface  tension,  i.e. 

\ngpd 3  ~  nDa,  (2) 

fa 

where  D  is  the  diameter  of  the  source.  Combining  (1)  and  (2)  gives  We  ~  6 D/d.  and  thus 
the  condition  for  the  drop  to  breakup  becomes  D  >  d,  i.e.  the  drop  must  detach  whilst  its 
diameter  is  less  than  that  of  the  source.  This  is  physically  unlikely  for  a  detaching  pendant 
drop,  see  for  example  [4]  and  figure  7,  and  implies  the  drop  would  initially  have  a  diameter 
of  over  7cm. 


Figure  7:  Depiction  of  a  pendant  drop  from  a  stalactite.  D  is  the  diameter  of  the  source  of 
liquid  and  d  is  the  diameter  of  the  drop  itself. 


4.2  Experiment 

To  disprove  the  ‘fall  down’  conjecture  experimentally,  the  setup  described  in  section  2  was 
used,  with  the  source  raised  2m  above  the  sheet,  and  a  20%  salt  solution  was  pumped  at 
a  rate  of  17.7  ±  0.5  ml  per  hour.  A  plastic  beaker  was  placed  on  the  sheet  directly  below 
the  source,  in  order  to  catch  the  primary  drop.  The  experiment  was  run  for  22  hours,  after 
which  the  beaker  contained  the  majority  of  the  water;  a  small  amount  had  bounced  out  as 
the  beaker  began  to  fill,  see  figure  8.  No  drops  were  observed  breaking  up  throughout  any 
of  the  experiments.  It  must  be  noted  that  2m  is  at  the  lowest  end  of  the  observed  rings,  see 
figure  6,  and  that  a  higher  fall  height  would  be  more  rigorous  in  disproving  the  ‘fall  down’ 
conjecture,  however  2m  was  the  limit  of  the  equipment  available. 

5  Coffee  Ring  Theory 

A  ‘coffee  ring’  is  the  deposit  formed  when  a  sessile  drop  of  a  solution  containing  dissolved 
particles,  such  as  coffee  or  salt,  dries.  This  was  investigated  by  Deegan  et.  al  in  [3],  and 
shown  to  be  the  result  of  a  combination  of  contact  line  pinning  and  non-uniform  evaporation 
flux.  If  the  contact  line  of  a  drop  is  pinned,  by  a  slightly  rough  surface  for  example,  then, 
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Figure  8:  Experiment  as  described  with  plastic  beaker  to  catch  primary  drops,  (a)  shows 
the  beaker  in  place  on  the  aluminium  sheet,  (b)  shows  a  manipulated  photo  to  highlight 
the  salt  crystals  present  after  22  hours. 


when  volume  is  lost,  the  majority  is  taken  from  the  centre  of  the  drop,  see  figure  9.  The 
rate  limiting  step  in  evaporation  from  a  drop  is  the  diffusion  of  water  vapour  away  from  the 
saturated  region  around  the  drop.  Due  to  the  geometry  of  the  drop,  assumed  to  be  close 
to  a  spherical  cap,  the  evaporation  is  enhanced  at  the  contact  line,  where  it  is  easier  for  a 
water  vapour  particle  to  escape,  see  figure  10(a). 


Figure  9:  Depiction  of  volume  lost  from  (a)  a  drop  where  the  contact  line  is  free  to  move 
and  (b)  a  drop  where  the  contact  line  is  pinned.  [3] 

Because  of  the  mis-match  in  the  location  of  maximum  mass  loss  and  maximum  evapo¬ 
ration  flux,  see  figure  10(b),  there  is  a  constant  flux  of  fluid  towards  the  edge  of  the  drop  to 
replace  the  evaporated  water.  As  the  majority  of  the  evaporation  occurs  at  the  edge  of  the 
drop,  the  majority  of  the  dissolved  particles  are  deposited  at  the  edge  of  the  drop,  resulting 
in  the  visible  ring  left  behind. 

While  this  is  a  well  formed  theory  for  drops  of  the  size  of  a  few  cm  or  less,  it  was 
proposed  that  this  could  also  be  the  mechanism  behind  the  formation  of  cave  rings.  If  the 
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(a)  (b) 


Figure  10:  (a)  Depiction  of  a  water  vapour  particle  performing  a  random  walk  at  the  edge 
of  the  drop.  It  is  more  likely  to  escape  near  the  contact  line,  (b)  Magnitude  of  mass 
loss,  dM/dtt ,  dashed  line,  and  evaporation  flux,  J,  line,  versus  radius  of  the  drop.  The 
evaporation  flux  is  singular  at  the  contact  line.  [3] 

flow  rate  in  the  cave  was  high  enough  to  produce  a  large  puddle,  perhaps  perhaps  during 
a  rain  storm,  then  the  coffee  ring  mechanism  could  produce  a  ring  of  limestone  deposit 
around  the  edge  of  the  puddle,  forming  the  cave  ring  left  behind. 

5.1  Large  Coffee  Ring  model 

5.1.1  Large  Puddle 

To  test  the  ‘coffee  ring’  hypothesis,  we  developed  a  model  for  large  coffee  rings.  We  begin 
by  modelling  a  large  puddle,  fed  by  a  source  and  losing  water  to  evaporation,  with  a  surface 
described  by  h(r,t),  using  the  2-D  quantities  depicted  in  figure  11,  under  the  following 
assumptions: 

•  slow,  viscous  flow, 

•  shallow  puddle,  R  »  H , 

•  source  within  r  =  a, 

•  no  slip  at  z  =  0, 

•  no  stress  at  z  =  h. 

The  momentum  balance  in  the  radial  and  vertical  directions  can  then  be  written  as 
follows: 

=  *%$  + smaller,  (3) 

p  or  ozz 

1  dp 

—  —  =  g  +  smaller,  (4) 

p  oz 

where  p  (r,  z,  t )  is  the  pressure  in  the  puddle,  p  is  the  density  of  the  fluid  (assumed  con¬ 
stant),  v  is  the  molecular  diffusivity  of  the  fluid,  and  g  is  gravitational  acceleration.  These 
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Figure  11:  Depiction  of  the  quantities  described  in  the  large  puddle  model.  h(r,t )  is  the 
surface  of  the  puddle,  (u,  w)  is  the  velocity  field  withing  the  puddle,  H  and  R  are  the 
vertical  and  radial  scales  respectively,  and  a  is  the  radius  containing  the  source. 


expressions,  along  with  the  boundary  conditions  previously  mentioned,  lead  to  the  following 
definitions  of  the  pressure,  p  and  horizontal  velocity,  u: 


p  =  pg{h  -  z), 


gdh 

U  =  —  —  Z(Z  —  zh). 

v  or 


(5) 


Mass  conservation  leads  to  the  following  relation  between  the  horizontal  and  vertical  veloc¬ 
ities,  u  and  w  respectively: 


_  19.  .  dw 

V  •  v  =  —  —  {ru)  +  —  =  0. 
r  or  oz 


(6) 


Integrating  (6)  in  the  vertical: 


dz  +  w(z) 


w(  0)  =  0. 


The  dynamical  boundary  conditions  for  w  are  as  follows: 


(7) 


w(z  =  h) 
w(z  =  0) 


dh  dh 
~di  +  U'fr  Z=h  +  W' 

H^a  v  )wsourcei 


evapi 


(8) 

(9) 


where  wevap  and  wsource  are  the  vertical  velocities  of  fluid  leaving  via  evaporation  and 
entering  from  the  source  respectively,  and  H(a  —  r)  is  a  Heaviside  function.  We  have 
assumed  here  that  the  source  is  placed  at  z  =  0,  but  the  results  that  follow  are  the  same  if 
the  source  is  instead  assumed  to  be  at  z  =  h.  If  we  evaluate  (7)  at  z  =  h: 


,  dh 
dz  +  u-  — 
or 


z—h 


dh 

H  7^  1“  'Wevap 


1  d  .  fh  ,  ,  dh 
r¥r{ri  Udz)+M 


+  W, 


evap 


H  ( Q,  T  )  WSOurce 
H  {(I  T  ^W  source 


0, 

0. 


(10) 

(11) 


The  form  of  wevap  can  be  found  from  Hu  and  Larson[6],  who  expanded  on  Deegan  et  al.  [3] 
to  find  an  exact  form  for  the  evaporation  flux  over  a  sessile  drop.  It  turns  out  that  solving 
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for  the  flux  is  equivalent  to  the  problem  of  deriving  the  electrostatic  potential  formed  by 
the  union  of  two  spherical  caps,  as  previously  solved  by  Lebedev[7].  Taking  the  form  for  a 
spherical  cap  with  contact  line  angle  equal  to  zero,  assuming  that  for  a  large  puddle  that 
this  will  be  approximately  correct,  then 


Wevap  — 


K  = 


2Dpv(l  -  9) 

Tip 


(12) 


where  R  is  the  radius  of  the  puddle,  D  is  the  molecular  diffusivity  of  water  vapour  in  air, 
pv  is  the  saturated  vapour  density  of  water,  p  is  the  density  of  liquid  water,  and  6  is  the 
ambient  relative  humidity. 

Combining  (5),  (11)  and  (12),  the  evolution  of  the  puddle  height  can  be  written  as 
follows: 


dh  TT(  \  Q  «  ,  2g  1  d  (  dh 

m=H{a- r)rf  -  7W=, is  +  T,-rTr  V  Trh  )  ■ 

where  q  is  the  source  flux  in  m3/s.  Assuming  a  steady  state,  i.e.  dh/dt 
solve  for  the  height  profile  of  the  puddle,  h: 


(13) 


0,  then  we  can 


h(r) 

9(r) 


21/4 


g(r)  +  H(r  —  a) 


log 


1  -vT^ 


-  V7!  -  r2, 


(14) 

(15) 


where  the  parameters  have  been  non-dimensionalised  as  follows:  h  — >  h/H,  r  — ►  r/R , 
a  — »  a/R.  The  non-dimensional  height  and  radial  scales  are  defined  as  follows: 


R  = 


Q 

2TTK1 


(16) 


The  form  of  h(r)  can  be  seen  in  figure  12  for  a  =  0.1. 


5.1.2  Solute  Dynamics 


We  now  include  the  concentration  of  a  passive  solute,  such  as  salt  or  coffee,  c(r,z,t),  and 
consider  a  section  of  the  puddle  from  n  to  r2  and  z  =  0  to  z  =  h,  as  depicted  in  figure  13. 
If  we  require  conservation  of  c  over  the  section,  then  this  implies: 


d_ 

dt 


rr2  rh(r,t) 


1 7*1  JO 


c(r,  z,t )  dz  r  dr 


rh(r,t)  r 2  rr2 

r  /  Jr(r,z,t)dz  +  /  rc(r,0,t)w(r,0,t)  dr, 

Jo  Jr i 

J  n 


(!7) 

where  Jr{r,z,t )  is  the  radial  flux  of  solute,  and  the  solute  arrives  through  the  bottom  with 
flux  Jz{r,z  =  0,  t)  =  w(r,z  =  0 ,t)c(r,z  =  0 , t)  =  wsourcecsource,  but  the  solute  cannot  leave 
the  puddle  through  evaporation,  and  so  the  flux  of  concentration  at  z  =  h  is  zero.  If  we 
rewrite  (17)  as  a  single  integral  over  (ri,^),  set  the  integrand  to  zero  since  the  limits  are 
arbitrary,  and  use  the  definition  of  J  =  cu  —  \V c,  we  get 


d_ 

dt 


f*h(r,t) 


c(r,  z,  t)  dz 


ld_ 

r  dr 


fblrt)  f  Qc\ 

r  J  \c(r,z,t)u{r,z,t)  —  Xr~Q~ J  dz 


+  cs 


source^  source 


(18) 
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Figure  12:  The  steady  state  solution  h(r),  the  height  of  the  puddle,  see  (14),  for  a  =  0.1. 


Figure  13:  Depiction  of  section  of  puddle  considered  in  section  5.1.2. 
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where  x  is  the  molecular  diffusivity  of  the  passive  solute. 

If  we  now  consider  the  dimensions  of  each  of  the  terms  in  the  advection-diffusion  equa¬ 
tion,  then  we  find  the  following  balance: 


M  +  7&(™c)  +  &(wc) 

uc 

R 


y  lJL(rQc 

A  r  gr  V  Or 
Xc 

W 


)  + 


yd!c 
XqZZ, 

I  Xc 

+  H?, 


(19) 


where  the  bottom  row  indicates  the  dimension.  The  second  term  on  the  RHS  dominates 
on  that  side  due  to  the  shallow  puddle  assumption  made  previously,  R  »  H.  Thus,  the 
ratio  between  the  left  and  right  sides  becomes  uH2 / Rx  =  H/R. Pe,  where  Pe  is  the  Peclet 
number  of  the  fluid.  The  Peclet  number  is  likely  to  not  be  large  in  a  dripping  cave  fluid,  and 
so  the  vertical  diffusion  term  clearly  dominates.  Thus,  we  can  assume  that  vertical  diffusion 
acts  fast  enough  to  homogeonize  the  solute  over  the  depth  of  the  fluid,  i.e.  c(r,  z,  t )  ps  c(r,  t), 
and  we  find  the  following  expression  for  the  evolution  of  the  concentration  flux  from  (18): 


—  —  (rcu)  +  H  (a 
r  or 


r)wscs  +  x 


ld_ 

r  dr 


(20) 


where  the  subscript  s  denotes  source  properties,  and  u  is  the  vertically  integrated  horizontal 
velocity  u.  From  now  on  we  will  drop  the  bar  over  c. 

If  we  first  consider  the  simpler  case  of  the  steady  state,  dh/dt,  then  we  can  rewrite  (20) 
as  follows: 


h^~  =  H(a  -  r)%  + 
dt  az  r  dr 


dh  3  dc 
r  I  c—hA  +  K  —h 
Or  Or 


(21) 


where  K  =  nxH/q,  r  =  Hq/ Attk2,  and  the  parameters  have  been  non-dimensionalised  as 
follows:  c  — ►  c/c o,  cs  —>  cs/cq,  t  —>  t/r,  and  h  and  r  as  previously,  where  Co  is  the  initial 
concentration  of  the  solute  within  the  puddle,  and  h  is  described  in  (14). 

Solving  this  numerically  as  an  initial  value  problem  using  the  pdepe  routine  in  Matlab 
2010a,  the  results  up  to  t  =  0.9  can  be  seen  in  figure  14,  where  a  =  0.1,  cs  =  1  and  K  =  0.01. 
It  can  be  seen  that,  as  expected,  the  solute  is  swept  to  the  edge  of  the  puddle  over  time, 
where  it  collects. 

(21)  can  also  be  solved  analytically  in  certain  regimes.  It  can  be  seen  from  figure  14 
that,  away  from  the  edge  of  the  puddle  (r  =  1),  the  solution  for  c(r)  approaches  a  single 
solution  over  time.  To  solve  for  this  solution  we  thus  set  ff  =  0.  We  also  assume  that,  as 
the  gradients  in  c  are  not  large  away  from  r  =  1,  that  diffusion  can  be  neglected  ( K  being 
small).  We  are  thus  left  with  the  following  expression  to  solve: 


1  d  (  dh  , .. 
r¥r  {rCfrh 


(22) 


Using  (14),  we  can  evaluate: 


1 

2 


Vl~r2  +  H{a 


(23) 


and  so  can  solve  (22),  using  the  boundary  condition  of  zero  flux  at  r  =  0,  and  matching 
fluxes  at  r  =  a,  to  find: 


c  « 


C-si 

Cs/\/T 


—  r“ 


r  <  a,  K  «  0, 
r  >  a,  K  «  0. 


(24) 
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Figure  14:  Solutions  of  (21)  for  t  =  0  to  0.9,  with  a  =  0.1,  cs  =  1  and  K  =  0.01. 


Thus,  the  solution  matches  the  source  concentration  within  the  source  area,  and  outside 
follows  the  evaporation  profile  (12).  Comparing  with  the  numerical  solution,  see  figure  15,  it 
can  be  seen  that  the  analytical  solution  is  best  for  low  r,  and  becomes  worse  as  the  gradient 
of  c  increases,  and  diffusion  becomes  more  important.  At  values  of  r  close  to  to  1  and  large 
times,  the  numerical  solution  for  c  becomes  linear,  see  figure  16. 

Thus,  we  can  assume  that  c(r,t)  takes  the  form  A(r)t  +  B(r )  near  r  =  1.  Introducing 
this  into  (21),  we  get  two  coupled  equations  describing  A  and  B: 


0 

hA 


ld_ 

r  dr 
ld_ 
r  dr 


.dh  3  dA 

A—hs  +  K—h 
dr  dr 

-dh  3  <9-B 

B~rr-  h'  +  K——h 

or  or 


(25) 

(26) 


We  initially  concentrate  on  (25),  and  redefine  r  in  terms  of  the  parameter  £,  which  is  small 
near  the  puddle  edge:  r  =  1  —  Ka(.  We  can  approximate  h  near  r  =  1,  and  find: 

/  o  \  1/4  /  9  \  1/4 

]  23/8(l  -  r)3/8  =  (  |  J  23/8/C3“/8C3/8,  (27) 


and  from  (23)  we  can  rewrite: 
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V2 


Ka'2  (I/2, 
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Figure  15:  Comparison  of  the  analytical  and  numerical  solutions  for  c(r)  at  large  times 
away  from  r  =  1. 


Figure  16:  Numerical  solutions  for  c(t )  at  various  radii.  Close  to  r  =  1,  the  concentration 
becomes  linear  with  time. 
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and  so  (25)  can  be  rewritten: 


r  ( A^h3  +  K^h]  «  -A-^=Ka/2(1/2  +  Kl~a^  ( -)  ^  23/*K3a/\3/3  =  0,  (29) 

\  or  or  J  V  2  dQ  \  3  J 

where  we  have  integrated  both  sides.  The  integration  constant  is  zero,  as  the  expression  is 
equal  to  zero  at  r  =  0.  Rearranging: 

0)  V4  2 VSKl~a^  =  K^AC1'2,  (30) 

and  choosing  a  =  8/9,  we  can  integrate  to  find 

A{r)  =  k  exp  j -0^(1  -  r)9/8|  ,  (31) 


where  A;  is  a  constant  to  be  determined. 

Considering  the  form  of  the  solution,  A(r)t  +  B(r),  we  know  that  A(r)  vanishes  away 
from  the  edge  of  the  puddle,  as  we  have  already  shown  that  there  is  no  time  dependence 
to  the  solution  within  the  interior.  Thus,  B{r)  must  match  the  solution  at  the  interior  at 
some  point  away  from  the  edge.  Equivalently,  the  total  flux  associated  with  B  must  match 
the  total  flux  of  the  interior  solution.  Thus  we  can  find  the  constant  k  by  evaluating  both 
sides  of  the  following  expression,  a  rewriting  of  (26): 


p  OO  p  OO  Q 

/  rhAKad(=  /  — 

/  o  Jo  d( 


r  |  A*  +  A 

or  or 


d(  =  interior  flux,  (32) 


or 


k  (|) 1/4  jT>(l  -  K>'\)  exp  |  ■ J  A  d(  =  -C/2,  (33) 


which  can  be  re-arranged  to  find  k(cs,K );  the  full  expression  can  be  found  in  appendix  A. 
We  compare  this  with  the  numerical  solution,  see  figure  17,  using  a  logarithmic  axis  for  the 
concentration.  The  solution  is  fairly  accurate  above  r  =  0.85,  improving  closer  to  r  =  1  as 
would  be  expected. 


5.1.3  Deposition  Dynamics 

In  order  to  simulate  the  production  of  speleothems,  we  now  include  deposition  via  a  sink 
term  in  the  concentration  evolution  equation,  assuming  immediate  sedimentation  to  the 
bottom.  We  must  also  include  an  equation  for  the  evolution  of  the  depth  of  the  deposit, 
d(r,t),  and  therefore  allow  the  height,  h(r,t)  to  evolve  also.  This  results  in  a  system  of 
three  coupled  equations: 
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Figure  17:  Comparison  of  the  analytical  solutions  (31)  and  (24)  for  the  concentration,  c,  at 
t  =  100,  with  the  numerical  solution  near  r  =  1.  As  before,  cs  =  1,  a  =  0.1  and  K  =  0.01. 
The  solution  for  r  ~  1  clearly  improves  as  it  approaches  1. 
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(34) 

(35) 

(36) 


where  csat  is  the  saturation  concentration  of  the  solute,  p  is  a  precipitation  rate,  s  is  a 
size  scale  for  the  concentration  of  crystals  per  depth  of  deposit,  R(t)  is  the  boundary  of 
the  puddle  and  L  =  ttk, /q.  h  and  c  parameters  have  been  non-dimensionalised  as  before, 
however  the  time  and  radial  parameters  have  been  non-dinrensionalized  as  follows:  r  — >  r/A, 
a  a/ A,  R  — >  R/ A,  t  —>  t/a,  where  A2/ct  =  q/itH.  For  limestone  (Calcium  Carbonate), 
p  is  a  function  of  temperature,  COo  pressure,  and  the  depth  of  the  fluid,  and  we  take  the 
value  from  [2],  The  evolution  equation  for  the  depth  of  liquid,  (h  —  d),  clearly  contains 
a  singularity  at  r  =  R  in  the  evaporation  term.  This  unfortunately  makes  the  system 
impossible  to  solve,  as  previously,  using  the  pdpe  routine  in  Matlab.  Due  to  the  limited 
time  available  on  the  project,  it  was  not  possible  to  develop  an  alternative  method  for 
finding  a  numerical  solution  before  completion.  It  must  also  be  noted  that  this  system 
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Figure  18:  Numerical  and  analytical  solutions  to  (34),  where  the  solid  blue  line  is  the 
concentration  profile  c(r)  at  t  =  10. 


now  has  variable  boundaries,  and  r  =  1  does  not  necessarily  correspond  to  the  edge  of  the 
puddle,  and  so  any  alternate  method  for  solving  the  system  would  need  to  consider  the 
variation  in  the  outer  boundary,  R(t). 

We  can,  for  small  values  of  d,  make  the  approximation  h  »  d,  i.e.  h  —  d  ~  h,  and 
so  again  use  use  the  steady  state  form  of  h  derived  in  section  5.1.1,  under  the  assumption 
dh/dt  =  0.  We  then  calculate  the  depth  profile,  d(r,t),  independently  from  the  concentra¬ 
tion  profile,  c(r,t).  The  resulting  concentration  profile  at  t  =  10  can  be  seen  in  figure  18, 
where  cs  =  1,  csat  =  1-1,  a  =  0.1,  and  K  =  0.01.  For  c  <  csat,  the  same  analytical  solutions 
apply  as  previously,  (24).  For  c  >  csat,  the  steady  state  solution  is  c  =  csat ■  It  is  possible 
to  find  the  transition  point  at  which  c  =  csat  from  (24): 


T{cSat)  — 


2 


and  the  analytical  solutions  can  be  seen  plotted  in  figure  18.  It  can  be  seen  that  the 
analytical  solution  doesn’t  hold  near  r  =  1,  where  the  advection  and  diffusion  terms  become 
important.  The  resulting  depth  profile,  d(r)  can  be  seen  in  figure  19,  along  with  the  height 
profile  h(r),  where  the  deposit  of  solute  has  clearly  created  a  ring  shape. 


5.2  Experiments 

To  test  the  ‘coffee  ring’  theory  we  first  lowered  the  source  to  within  a  few  mm  of  the 
aluminium  sheet  in  order  to  first  ‘grow’  a  puddle  to  test  the  concept.  Initially,  the  results 
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Figure  19:  Solute  concentration,  blue,  deposit  depth,  green,  and  liquid  height,  black,  profiles 
at  t  =  10.  Note  that  the  deposit  depth  profile  has  been  multiplied  by  104  for  clarity. 


looked  promising.  Figure  20  shows  the  ring  of  salt  produced  by  a  puddle  a  few  cm  in 
diameter.  However,  increasing  the  rate  of  flow  to  grow  puddles  of  the  size  required  to 
explain  the  cave  rings  did  not  produce  the  results  hoped  for.  Past  a  few  cm,  growth  is 
dominated  by  surface  tension  effects  at  the  contact  line  of  the  puddle,  see  figure  21.  Whilst 
coating  the  aluminium  sheet  in  vegetable  oil  to  reduce  this  effect  did  result  in  a  smoother 
boundary  to  the  puddle,  it  still  was  not  possible  to  produce  a  large,  round  puddle.  Given 
that  the  aluminium  sheet  was  kept  as  flat  as  was  possible,  and  that  cave  floors  are  hardly 
uniform  surfaces  (see  figure  1),  this  would  seem  to  disprove  the  ‘coffee  ring’  hypothesis.  It 
was  possible  to  find  a  flow  rate  at  which  evaporation  matched  the  source  flux,  see  figure  22, 
but  this  was  definitely  not  a  circular  puddle. 

The  second  set  of  experiments  involved  raising  the  source  by  2m  in  order  to  see  if  the 
dynamics  of  the  source  dropping  from  above  would  affect  the  puddle  formation.  The  source 
now  arrived  in  drops  of  just  under  1  ml  in  volume,  which  is  similar  to  the  volumes  mea¬ 
sured  in  situ  by  Nozzoli  et  al.  [9] .  It  was  found  that,  because  the  fraction  of  mass  ejected  as 
secondary  drops  was  quite  high,  a  higher  flow  rate  was  required  to  produce  a  puddle  of  a 
substantial  size  underneath  the  source.  The  correlation  between  puddle  size  and  flow  rate 
was  expected  from  our  model,  however,  while  a  substantial  amount  of  crystals  formed  from 
the  secondary  splashes,  none  formed  on  the  puddle  itself,  see  figure  23.  It  is  speculated 
that,  from  observations  of  the  experiment  in  process,  the  large  amount  of  kinetic  energy  im¬ 
parted  into  the  puddle  by  each  drop  effectively  mixed  the  puddles  so  that  the  concentration 
profile  predicted  did  not  evolve,  the  solution  within  the  puddles  never  exceeding  saturation 
concentration  once  equilibrium  was  reached.  As  with  the  previous  splashing  experiments, 
see  section  3.2,  the  salt  splash  formed  a  distribution  with  an  exponential  tail,  see  figure  24. 

We  speculated  that  perhaps  the  porosity  of  the  cave  floor  might  dictate  the  formation 
of  the  cave  rings,  and  so  repeated  the  experiment  with  piece  of  black  cloth  on  top  of  the 
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Figure  20:  Initial  experiment  with  the  source  close  to  the  sheet,  producing  a  small  ring  of 
salt.  10%  salt  solution  was  pumped  at  a  rate  of  1.4  ±  0.2  ml  per  hour  for  46.5  hours 


(a)  (b) 


Figure  21:  Two  experiments  that  show  the  difficulties  in  creating  a  perfectly  circular  puddle, 
both  with  a  10%  salt  solution  pumped  at  a  rate  of  14.7  ±  0.7  ml  per  hour,  (a)  shows  a 
puddle  after  2.25  hrs  with  the  original  setup,  (b)  shows  a  puddle  after  9  hrs,  where  the 
sheet  has  had  a  thin  layer  of  vegetable  oil  applied  to  reduce  surface  tension  effects. 
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Figure  22:  Experiment  with  flow  rate  of  6.4  ±  0.4  ml  per  hour  after  5.25  hrs.  Equilibrium 
has  been  reached  and  salt  crystals  are  being  deposited  at  the  edge  of  the  puddle,  but  the 
puddle  is  not  circular. 


Figure  23:  Experiment  with  source  ~2m  above  the  sheet,  with  10%  solution  being  pumped 
at  a  rate  of  17.17  ±  0.5  ml  per  hour,  after  26.75  hrs.  The  salt  deposits  are  produced  by 
secondary  droplets  ejected  from  the  puddle,  which  can  be  seen  in  the  centre  of  the  picture. 
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Figure  24:  Salt  coverage  vs  radius  for  figure  23  with  an  exponential  curve  fitted.  There  is 
a  ~  10  cm  ring  where  the  area  is  saturated. 


Figure  25:  Experiment  as  previously,  with  black  cloth  covering  the  sheet,  after  21.5  hrs. 
Salt  has  been  deposited  at  the  boundary  of  the  wetted  region,  but  the  shape  is  non-uniform. 
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sheet,  in  order  to  test  whether  the  spread  of  the  puddle  through  a  porous  medium  might 
be  more  uniform.  The  resultant  salt  formation  can  be  seen  in  figure  25.  A  ring  has  formed, 
as  the  previous  mixing  of  the  solution  within  the  puddle  is  not  possible,  however  the  shape 
is  still  non-uniform. 

5.3  Speleothem  Science 

As  it  appears  that  the  coffee  ring  theory  is  not  able  to  describe  the  large  cave  rings  observed, 
we  turned  to  the  science  of  cave  formations,  or  speleothems.  There  is  a  huge  variety  of 
formations,  but  the  basic  method  of  formation  is  the  same  for  most,  depicted  in  figure  26: 

•  Rain  water  seeps  down  through  the  top  layer  of  soil,  dissolving  Carbon  Dioxide  to 
form  Carbonic  acid. 

•  On  contact,  the  acid  dissolves  the  bedrock,  normally  Calcium  Carbonate. 

•  The  solution  comes  into  contact  with  air  on  meeting  a  cave,  and  the  dissolved  stone 
is  then  deposited  out. 


Figure  26:  Depiction  of  the  formation  of  speleothems,  through  the  formation  of  Carbonic 
acid  and  the  dissolving  of  Calcium  Carbonate. 

The  overall  chemical  reaction  involved  is  as  follows: 

CaC03  +  C02  +  H20  ^  Ca2+  +  2HCO^ .  (37) 
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The  equilibrium  for  the  reaction  is  towards  the  right,  the  ionic  solution,  when  the  solution  is 
within  the  rock,  not  in  contact  with  air.  However,  when  the  solution  is  in  contact  with  the 
air,  the  chemical  equilibrium  moves  to  the  left,  and  so  CO2  is  out-gassed  quickly  and  CaCOj, 
deposited,  as  the  solution  is  over-saturated.  This  is  the  case  for  the  majority  of  formations, 
although  in  some  situations,  such  as  in  highly  humid  regions  or  when  the  path  the  liquid 
takes  is  not  sealed  from  air,  the  solution  is  under-saturated.  The  modelling  for  this  project 
assumed  that  the  solution  was  under-saturated,  as  is  the  case  for  the  salt  solution  used  in 
the  experiments.  As  cave  rings  are  relatively  rare,  it  could  be  that  they  are  in  one  of  the 
unusual  situations  mentioned,  however  there  is  currently  no  data  on  the  concentrations  of 
Calcium  Carbonate  in  the  solutions  forming  cave  rings. 

It  is  possible  to  solve  the  model  described  in  section  5.1.3  under  the  condition  that 
the  source  concentration  is  greater  than  the  saturation  concentration,  i.e.  cs  >  csat ■  The 
resulting  solution  at  t  =  10  for  csat  =  0.5,  cs  =  1,  is  shown  in  figure  27.  It  can  be  seen  that, 
although  some  of  the  solute  is  still  swept  to  the  edge  of  the  puddle,  most  is  deposited  in 
the  centre,  where  the  source  is.  The  resultant  deposit  is  shaped  like  a  classic  stalagmite. 


Figure  27:  Numerical  solution  with  csat.  =  0.5,  cs  =  1,  K  =  0.01,  and  a  =  0.1,  at  t  =  10. 
The  deposit  depth,  green  line,  has  been  scaled  by  103  to  allow  for  comparison. 


6  Conclusion 

This  project  set  out  to  test  the  hypotheses  set  out  in  Nozzoli  et  al.  [9]  for  the  formation  of 
cave  rings,  namely  the  ‘splash  ring’  and  ‘fall  down  ring’  hypotheses,  and  to  test  our  ‘coffee 
ring’  theory.  It  was  shown  clearly  that  both  the  ‘splash’  and  ‘fall  down’  hypotheses  were 
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unverifiable  experimentally. 

A  mathematical  model  for  large  coffee  rings  was  developed,  and  numerical  solutions  that 
matched  our  expectations  for  ring  formation  were  produced.  Unfortunately,  experimenta¬ 
tion  showed  that  the  flaw  in  our  theory  was  that  the  formation  of  large,  circular  puddles 
seems  to  be  impossible  on  a  realistic  surface,  as  surface  tension  at  the  contact  line  seems  to 
dominate  while  the  puddle  grows,  producing  non-uniform  shapes.  Experiments  also  showed 
that  the  kinematics  of  drops  arriving  from  2m  produces  significantly  different  sized  puddles 
than  those  where  the  source  was  close  to  the  sheet.  The  majority  of  cave  formations  re¬ 
sult  from  solutions  that  are  over-saturated  in  Calcium  Carbonate  when  in  contact  with  air, 
whereas  our  experiments  used  under-saturated  salt  solution. 

It  seems  clear  that  coffee  rings  are  unlikely  to  be  the  formation  method  for  cave  rings, 
although  there  is  still  a  little  work  to  be  done  before  it  can  be  ruled  out  completely.  Due 
to  the  limitations  of  the  equipment  and  time  available,  it  wasn’t  possible  to  create  puddles 
larger  than  c.  10  cm  when  the  source  was  raised  2m  above  the  sheet.  Using  a  higher  source 
flux,  it  should  be  possible  to  see  if  the  kinematics  of  the  arriving  drops  affect  the  shape  of 
larger  puddles.  Also,  to  rule  out  the  ‘fall  down  ring’  hypothesis  more  comprehensively,  an 
experiment  could  be  designed  where  the  source  passes  through  a  hole  in  a  sheet,  and  the 
water  collected  below,  in  order  to  show  that  no  drops  split  and  land  on  the  sheet. 

In  general,  there  appears  to  be  a  dearth  of  mathematical  work  in  the  area  of  speleothem 
formation,  with  an  astounding  range  of  speleothems,  both  beautiful  and  bizarre,  still  not 
understood.  The  development  of  a  more  accurate  mathematical  model  for  such  formations 
could  only  be  of  benefit  to  the  geological  community. 
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A  Full  Concentration  Solution 

In  section  5.1.2,  the  time  dependent  part  of  the  solution  to  (21)  close  to  the  edge  of  the 
puddle  at  large  t,  A,  is  set  out  in  (31),  where  the  constant,  k,  is  given  by  (33).  Rearranging, 
we  find 

ql/4  r  poo  (  ol5/8  'l 

k(csat,K )  =  -^Csat  K11/9  (3/8  exp  |  - ^yrC9/8  MC 

roo  (  ol5/8  J  I  — 1 

-Kla,SJo  CI1/8expj-LFC9/8|*;J  ,  (38) 

ql/4  r  -i  _i 

~  [1.74R11/9  -  3.48R19/9]  ,  (39) 
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1  Introduction 

A  classical  problem  in  fluid  mechanics  is  that  of  a  jet  impinging  on  a  solid  surface.  During 
the  1960’s,  partly  driven  by  applications  in  the  steel  industry,  the  exploration  of  this  field 
of  study  shifted  towards  the  interaction  between  a  gas  jet  and  a  deformable  liquid  surface 
(Figure  1),  where  often  the  shape  of  the  deformation  and  its  relation  to  the  balance  of  forces 
were  studied.  This  is  also  a  very  simplified  representation  of  the  interaction  between  the 
wind  and  the  ocean. 


Figure  1:  Schematic  drawing  of  the  experimental  set  up.  Air  is  blown  from  a  nozzle  at  right 
angles  with  a  still  water  surface  in  a  tank. 

The  first  report  of  work  on  this  subject  is  by  Banks  and  Chandrasekhara  (1),  who 
performed  experiments  with  an  air  jet  impinging  on  a  water  surface.  They  identified  three 
regimes:  a  steady  cavity,  an  oscillating  cavity,  and  splashing.  Only  the  first  of  these  three 
regimes  was  studied  in  more  detail  and  scaling  approaches  were  suggested  to  establish  a 
relation  between  the  impact  of  the  jet  and  the  depth  of  the  cavity.  A  few  years  later 
Turkdogan  (9)  performed  similar  experiments,  but  included  liquids  of  different  densities 


196 


iii  the  analysis  and  especially  focused  on  the  relation  between  the  nozzle  height  and  the 
depth  and  width  of  the  cavity.  Cheslak  et  al.  (3)  also  studied  the  geometric  properties  of 
a  steady  air-blown  dimple  and  found  simple  relations  predicting  the  depth  and  diameter  of 
the  cavities. 

A  different  approach  was  taken  by  Olmstead  and  Raynor  (6),  who  solved  the  whole 
shape  of  steady  shallow  dimples  using  conformal  mapping.  Almost  two  decades  later,  new 
numerical  procedures  enabled  Vanden-Broeck  (10)  to  study  slightly  deeper  cavities  with 
this  method,  but  a  conformal  mapping  approach  to  this  problem  without  limitations  to  the 
depth  of  the  cavity  was  only  provided  recently  by  He  and  Belmonte  (5). 

Although  instability  of  the  cavity  at  higher  jet-impact  velocity  was  already  reported 
by  Banks  and  Chandrasekhara  (1),  surprisingly  little  attention  has  been  given  to  this  phe¬ 
nomenon.  The  available  literature  is  restricted  to  predicting  the  onset  of  instability,  using 
energy  balances  to  determine  the  decisive  factors.  A  pioneering  study  by  Rosier  and  Steward 
(8)  identified  two  unstable  regimes  in  their  experiments:  an  unstable  oscillating  regime  and 
a  splashing  regime.  In  the  former  regime  three  modes  of  oscillation  were  identified  which 
were  described  qualitatively  (Figure  2).  The  onset  of  the  unstable  and  splashing  regimes 
were  shown  to  depend  on  a  critical  jet  velocity,  with  a  strong  dependence  on  the  interfacial 
surface  tension.  Berghmans  (2)  showed  in  a  theoretical  study  that  increasing  liquid  density 
or  surface  tension  is  stabilizing,  while  increasing  the  gas  density  is  destabilizing.  The  jet 
radius  was  also  found  to  be  important. 


E-mode 


V-mode 


B-mode 


Figure  2:  The  three  modes  of  instability.  In  the  E-mode  the  tip  of  the  dimple  traces  out  an 
ellipse.  In  the  V-mode,  the  tip  oscillates  in  the  vertical  direction  only.  In  the  B-mode,  the 
tip  rolls  up  the  side  of  the  dimple. 

The  reason  behind  the  sparse  coverage  of  this  very  interesting  problem  is  likely  in  the 
limitations  imposed  by  the  available  observational  equipment.  While  with  the  cameras  used 
four  decades  ago  this  regime  was  very  difficult  to  study  experimentally,  today’s  high-speed 
cameras  offer  the  opportunity  to  capture  the  movements  of  the  cavity  and  study  them  in 
detail.  To  our  knowledge  there  is  one  study  (4)  in  which  an  attempt  was  made  to  study 
the  oscillations  of  the  cavity  using  a  video  camera,  but  no  conclusions  could  be  drawn  from 
these  results.  Here,  we  use  a  high  speed  camera  to  obtain  movies  of  unstable  dimples  in  a 
variety  of  regimes  for  an  air-water  system.  We  explore  the  dependence  of  the  regimes  on 
key  nondimensional  parameters  and  study  the  spectral  properties  of  the  dimple  oscillations. 

The  experimental  set  up  and  image  processing  are  discussed  in  Section  2.  The  results 
are  first  discussed  from  a  qualitative  point  of  view  in  Section  3.  Then  a  quantitative  analysis 
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is  given  in  Section  4,  on  the  geometric  properties  of  the  dimple  as  well  as  a  spectral  analysis 
of  the  dimple  oscillations.  Section  5  discusses  the  results  and  gives  an  overview  of  the 
conclusions  from  this  study. 


2  Method 

2.1  Experimental  set  up 

The  jet  in  the  experiment  was  produced  by  attaching  the  wall  air  supply  to  a  nozzle,  which 
was  fixed  to  an  adjustable  frame  above  a  layer  of  water  in  a  tank.  The  circular  nozzle  had 
an  inner  diameter  of  1.35  mm.  The  square  Plexiglas  tank  was  11.2  cm  in  height  and  20  cm 
wide.  The  water  depth  in  the  tank  was  7  cm  unless  stated  otherwise.  To  regulate  the  air 
flow  rate  in  the  jet,  a  pressure  meter  (Digital  air  regulator,  Central  Pneumatic  Professional, 
model  no.  98426)  was  used.  The  pressure  was  calibrated  with  a  volume  rate  meter  (Omega 
Engineering  Gas  Mass  Flow  Meter  FMA  1818)  to  obtain  the  flow  rate  Q.  The  frame  to 
which  the  nozzle  was  attached  could  be  moved  up  and  down  to  adjust  its  height  above  the 
water  surface;  the  position  could  be  read  from  a  calibrated  scale.  The  fluid  properties  of 
interest  to  this  experiment  are  the  kinematic  viscosity  of  the  gas  ( ug  =  15.68  •  ICC6  m2  s'1), 
the  densities  of  the  gas  and  the  liquid  (pg  =  1  kg  nr"3;  pi  =  1000  kg  m"3),  and  the  interfacial 
surface  tension  (a  =  71.97  dyn  cm"1  =  71.97  •  10~3  N  nr"1).  The  data  acquisition  is  done 
with  a  digital  high-speed  camera  (Phantom  v5.0)  at  a  3000-fps  frame  rate.  Only  for  the 
surface  views  (Section  3.2)  the  frame  rate  was  adjusted  to  300  fps. 


2.2  Parameter  regime 


The  two  key  nondimensional  parameters  that  are  varied  in  this  study  are  the  nondimensional 
nozzle  height  (£  =  h/d),  which  determines  the  distance  over  which  the  spreading  jet  can 
evolve,  and  the  Reynolds  number,  which  (for  fixed  Q  determines  the  impact  pressure  of  the 
jet.  The  Reynolds  number  is  here  defined  as 


Re  = 


4  Q 

7 X  d  Vg' 


and  varies  between  8.5  •  102  and  3.4  •  103.  (  varies  between  7.4  and  37.0,  equivalent  to  1  and 
5  cm  above  the  water  surface,  respectively. 

Four  other  nondimensional  parameters,  which  are  mentioned  in  literature  to  be  impor¬ 
tant  for  this  problem,  are  not  consciously  changed  but  vary  along  with  (  and  Re.  The  first 
is  the  nondimensional  depth  of  the  dimple  rj  =  hc/d  (variation  between  0.8  and  10.0).  The 
second  and  third  are  a  measure  for  the  importance  of  surface  tension.  The  Weber  number 
(2;  5),  defined  as  the  ratio  between  inertial  forces  and  surface  tension,  is  given  by 

Pa  ( l/2r<P  )  riet 

We  =  - - - - - , 

<7 

and  varies  in  our  experiments  between  1.4  and  105  The  Bond  number  (2;  5),  which  is  the 
ratio  between  gravitational  forces  and  surface  tension,  is  given  by 

D  {Pi  ~  Pa)  9  rfet 
Uo  = - , 

C 7 
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and  varies  between  0.1  and  3.  The  last  nondimensional  number  of  importance  to  this  study 
is  the  Froude  number  (1),  which  is  defined  as  the  ratio  between  a  characteristic  velocity 
and  a  gravitational  wave  velocity: 

=  (i/4^) 

VgK  ’ 

and  varies  between  107  and  127  in  our  experiments.  The  dynamics  of  the  surface  waves 
were  however  not  studied  here  quantitatively. 

2.3  Image  processing 

A  five-step  process  was  followed  in  obtaining  the  contours  of  the  cavity  from  the  movies 
(Figure  3).  First  the  black  and  white  avi  movies,  captured  with  the  high-speed  camera,  were 
loaded  into  Matlab  and  separated  into  individual  frames  (Figure  3a).  Second,  to  maximize 
the  amount  of  information  that  could  be  extracted  from  the  movie  frame,  the  contrast  in  the 
images  was  enhanced  using  CLAHE  (Contrast-Limited  Adaptive  Histogram  Equalization, 
Figure  3b).  This  technique  determines  the  histogram  of  the  grayscale  values  of  the  pixels  for 
small  regions  in  the  image  and  then  enhances  the  contrast  in  such  a  way  that  the  histogram  of 
the  output  approximately  matches  a  uniform  distribution.  Neighbouring  tiles  are  combined 
using  bilinear  interpolation  to  eliminate  artificially  induced  boundaries.  An  edge  detection 
scheme  is  applied  on  the  filtered  image  to  define  the  shape  of  the  cavity  (Figure  3c).  Note 
that  the  contrast  enhancement  step  improves  the  result  of  the  edge  detection  considerably, 
especially  tracing  the  shallow  part  of  the  cavity  (Figure  4).  The  Sobel  method  was  found  to 
give  the  best  representation  of  the  deep  as  well  as  the  shallow  part  of  the  cavity  shape.  This 
method  calculates  the  first  derivative  of  the  grayscale  intensity  in  both  x  and  y  directions, 
and  locates  the  edge  at  minimum  and  maximum  values  of  this  gradient  as  long  as  this 
value  exceeds  a  certain  threshold  value.  Last,  the  outer  contour  of  the  cavity  was  traced  by 
moving  upward  along  the  vertical  direction  in  the  frame  and  tracing  the  first  ’edge-point’ 
for  every  column  (Figure  3d).  The  shapes  thus  obtained  were  lowpass  filtered  to  reduce 
artificial  noise  introduced  in  the  detection  process,  before  statistics  such  as  the  cavity  depth, 
the  angle  with  the  vertical,  and  frequencies  were  calculated.  These  are  discussed  in  Section 
4.  For  the  spectral  analysis  the  time  series  were  first  detrended,  after  which  a  Hann  filter 
was  applied.  The  frequency  spectrum  was  calculated  using  a  FFT  algorithm. 

3  Experimental  phenomenology 

3.1  Cavity 

A  variety  of  shapes  and  movements  of  the  dimple  are  observed  in  the  experiments.  In  Figure 
5  the  sequence  of  modes  is  shown  when  the  Reynolds  number  of  the  jet  is  increased  while 
the  nozzle  height  is  fixed  at  the  lowest  position  considered  in  this  study  (£  =  7.4).  For  low 
jet  Reynolds  numbers  the  cavity  is  stable  and  slowly  deepens  when  the  jet  flow  is  increased 
(Figure  5a).  We  will  refer  to  this  state  as  the  S-mode.  At  a  critical  value  of  Re  =  1075 
instability  sets  in  and  the  tip  of  the  dimple  traces  out  an  ellipse  (E-mode,  Figure  5b).  This 
mode  only  exists  for  a  very  narrow  range  of  the  jet  Reynolds  number,  as  it  quickly  changes 
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original  image 


enhanced  contrast 


edges 


Mean  cavity  shape 


Figure  3:  From  the  original  image  frame  to  a  cavity  outline.  First  the  contrast  is  enhanced 
using  CLAHE  (b),  then  the  edge  is  detected  using  the  Sobel  method  (c).  The  contour  of 
the  cavity  is  found  from  the  detected  edges  (black  solid  line  in  (d)),  and  lowpass  filtered  for 
further  analysis  (gray  dotted  line  in  (d)).  (C  =  7.4,  Re  =  1050) 


edges  original  image 


edges  filtered  image 


(a) 


(b) 


Figure  4:  Improvement  in  the  edge  detection  results  due  to  contrast  enhancement. 


(a)  S-mode  (b)  E-mode  (c)  V-mode  (d)  B-mode  (e)  Bubbling  and 

splashing 


Figure  5:  Subsequent  stages  of  the  dimple  with  increasing  jet  Reynolds  number  for  a  fixed 
nozzle  height  (£  =  7.4).  (a)  A  stable  dimple  (Re  =  1030).  (b)  The  tip  traces  out  an  ellipse 
(Re  =  1075).  (c)  The  tip  moves  up  and  down  vertically  (Re  =  1095).  (d)  The  tip  bends 
and  rolls  up  the  side  of  the  dimple  (Re  =  1230).  (d)  Bubbling  and  splashing  (Re  =  3360). 
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into  a  purely  vertical  oscillation  of  the  tip,  here  referred  to  as  the  V-mode  (Figure  5c).  If 
the  flow  rate  in  the  jet  is  increased  even  further,  the  tip  starts  to  bend  and  roll  up  the 
side  of  the  dimple,  which  we  call  the  B-mode  (Figure  5d).  At  very  high  Reynolds  numbers 
(Figure  5e)  bubbling  and  splashing  is  observed. 


Regime  diagram 
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Figure  6:  Overview  of  the  observed  regimes  as  a  function  of  the  jet  Reynolds  number  Re 
and  the  nondimensional  nozzle  height  £. 

The  onset  of  the  next  mode  shifts  to  different  Reynolds  numbers  at  the  other  nozzle 
heights  considered  in  this  study.  The  regime  diagram  in  Figure  6  shows  the  observed  modes 
for  several  combinations  of  £  and  Re.  A  first  interesting  feature  from  this  graph  is  the 
increase  in  Reynolds  number  required  for  instability  with  £  for  the  lowest  three  values  of 
£  considered  (beneath  the  solid  black  line  in  Figure  6).  For  the  larger  nozzle  heights,  the 
required  Reynolds  number  decreases  with  £.  Although  the  measurements  are  sparser  in 
the  remainder  of  the  regime  diagram,  this  seems  to  be  a  robust  feature  as  indicated  by  the 
speculative  dashed  lines.  Also,  in  the  two  largest  £  cases  the  dimple  starts  ’shivering’  when 
it  is  still  stable,  i.e.  there  are  very  small,  noise-like  oscillations.  It  is  well  known  that  it 
takes  a  distance  of  about  30  nozzle  diameters  for  a  turbulent  jet  to  become  fully  developed 
(7).  We  therefore  believe  that  the  horizontal  solid  line  indicates  the  nozzle  height  above 
which  the  jet  shows  fully  developed  turbulence.  The  noise-like  behaviour  above  this  height 
is  then  due  to  instability  of  the  jet  rather  than  an  instability  of  the  dimple. 

A  second  result  from  this  figure  is  that  for  all  the  nozzle  heights  except  for  the  smallest 
one,  the  dimple  widens  at  high  Reynolds  number  (Figure  7b,  a  stable  small  dimple  is  given 
in  Figure  7a  for  comparison).  We  speculate  that  the  width  of  the  dimple  is  in  general 
restricted  by  the  interfacial  surface  tension,  but  that  the  high  impact  pressure  of  the  higher 
Reynolds  number  jet  is  able  to  overcome  the  surface  tension  and  creates  a  wider  dimple.  At 
very  small  £  the  dimple  displays  bubbling  and  splashing  behaviour.  It  is  expected  however 
that  the  dimple  will  enter  the  splashing  state  at  the  larger  nozzle  heights  as  well  at  even 
higher  jet  Reynolds  numbers,  which  were  not  currently  studied.  It  should  be  noted  that  the 
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modes  are  not  entirely  identical  for  the  different  nozzle  heights.  For  example,  at  (  =  24.4 
the  V-mode  is  a  very  elegant  one  and  resembles  a  Mexican  hat  (Figure  7c).  However,  the 
overall  behaviour  is  the  same. 


(a)  (b)  (c) 

Figure  7:  Snapshots  (a)  Stable  dimple:  (  =  22.2,  Re  =  990  (b)  Wide  oscillating  dimple: 
(  =  14.8,  Re  =  2270  (c)  The  ’’mexican  hat”,  a  very  elegant  V-mode:  (  =  24.4,  Re  =  1400 

In  this  study  we  focused  on  an  air-water  system.  The  influence  of  interfacial  surface 
tension  a  and  liquid  viscosity  vi  were  studied  briefly  as  well  from  a  qualitative  point  of  view 
only.  The  surface  tension  was  altered  by  adding  2-3  drops  of  dish-washing  liquid  (Ajax 
Lemon)  to  the  water  in  the  tank,  which  decreased  the  stability  of  the  dimple.  To  test  the 
influence  of  viscosity,  the  tap  water  was  replaced  by  glycerol  and  water-glycerol  mixtures. 
The  experiments  clearly  showed  increased  stability  of  the  dimple,  but  were  not  conclusive 
quantitatively. 

3.2  Surface 

While  motions  of  the  liquid  phase  are  consistently  neglected  in  analyses  presented  in  litera¬ 
ture,  the  observed  motion  shows  an  interesting  pattern.  When  the  cavity  is  in  the  S-mode 
and  thus  not  moving,  the  flow  at  the  surface  displays  a  saddle-point  like  pattern  at  the  loca¬ 
tion  of  the  dimple,  with  water  flowing  in  from  two  sides  and  flowing  out  in  the  perpendicular 
directions.  It  starts  up  as  soon  as  the  jet  is  switched  on,  and  ceases  when  the  air  source  is 
switched  off.  When  the  air  rate  is  turned  up,  waves  are  formed  at  the  surface  which  spiral 
out  from  the  cavity  (Figure  8a)  accompanying  the  E-mode.  At  higher  Reynolds  number 
the  surface  waves  were  more  axisymmetric  (Figure  8b)  and  increased  in  amplitude  with 
increasing  jet  Reynolds  number.  The  wave  formation  was  clearly  triggered  by  the  dimple 
oscillations,  but  a  quantitative  analysis  of  the  surface  phenomena  are  beyond  the  scope  of 
this  study. 

4  Data  analysis 

4.1  Geometric  properties  of  the  dimple 

The  cavity  depth  ( hc  in  Figure  1)  increases  with  increasing  jet  strength  and  decreases  with 
increasing  nozzle  height.  This  is  shown  in  Figure  9a,  where  the  symbols  correspond  to  the 
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(a) 


(b) 


Figure  8:  Snapshots  of  the  surface  wave  field  (a)  E-mode:  (  =  7.4,  Re  =  1075  (b)  Bubbling 
and  splashing:  (  =  14.8,  Re  =  2270 


cavity  depth,  averaged  over  all  frames  of  a  single  movie  with  the  corresponding  jet  strength 
Re  (x-axis)  and  nozzle  height  (j.  The  colour  coding  for  (  and  symbols  for  mode  of  instability 
are  the  same  as  used  in  Figure  6.  The  low-Reynolds  number  regime  is  enlarged  in  Figure  9b. 
The  maximum  cavity  depth,  which  is  a  more  objective  measure  for  oscillating  dimples,  is 
plotted  in  Figures  10a  and  10b.  (Note  that  cavity  depths  on  the  red  line  (£  =  37.0)  exceed 
the  values  on  the  yellow  line  (£  =  29.6)  in  the  low-Reynolds  number  regime.  This  is  not  a 
physical  feature  however,  but  a  result  of  increasing  difficulty  in  obtaining  sharp  images  as 
the  bottom  of  the  cavity  approached  the  meniscus.)  The  area  of  the  two-dimensional  image 
of  the  cavity  shows  a  similar  behaviour  (not  shown). 

The  depth  of  the  stable  cavities  slowly  increases  when  the  jet  strength  is  raised  (generally 
Re  <  1050).  Then  a  clear  difference  is  observed  between  the  three  smallest  nozzle  heights 
and  the  two  larger  ones.  In  the  former  case,  the  cavity  depth  raises  quickly  between  a 
Reynolds  number  of  approximately  1050  and  1200,  around  the  point  of  instability.  In  the 
latter  case  the  cavity  depth  only  increases  slightly  over  this  range  of  Reynolds  numbers.  This 
is  in  line  with  the  preliminary  conclusion  from  Figure  6  in  Section  3.1  that  the  behaviour  of 
the  dimple  is  different  for  the  larger  two  nozzle  heights:  the  turbulence  in  the  jet  is  already 
fully  developed  and  therefore  the  peak  velocity  is  lower  at  impact.  The  jet  is  therefore 
unable  to  exert  a  large  enough  pressure  on  the  water  surface  to  deepen  the  dimple  further. 

A  second  interesting  property  is  the  evolution  of  the  curvature  of  the  cavity  as  the  jet 
strength  is  increased.  In  Figure  11  the  maximum  curvature  is  given  for  the  lowest  three 
nozzle  heights  (the  jet  does  not  form  a  stable  cavity  over  a  range  of  Re  at  larger  nozzle 
heights  and  the  curvature  can  therefore  not  be  determined  well).  As  the  point  of  instability 
is  approached,  the  maximum  curvature  quickly  increases.  The  lines  suggest  that  instability 
sets  in  when  a  critical  value  of  the  curvature  is  exceeded,  around  11  •  1CP3.  Figure  10a 
suggests  that  a  similar  critical  value  of  the  cavity  depth  (or  jet  impact  velocity)  may  exist 
for  the  onset  of  the  bubbling  and  splashing  phase.  In  the  current  set  of  experiments  this 
critical  depth,  which  is  at  11  to  12  mm,  is  only  reached  in  the  smallest  (  case. 
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Figure  9:  Mean  cavity  depth  hc  as  a  function  of  the  jet  Reynolds  number.  The  colours  and 
symbols  are  as  in  Figure  6. 
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Figure  10:  Maximum  cavity  depth  hc  as  a  function  of  the  jet  Reynolds  number.  The  colours 
and  symbols  are  as  in  Figure  6. 
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Maximum  curvature 


Figure  11:  Maximum  curvature  of  the  cavity  outline  as  a  function  of  the  jet  Reynolds 
number  for  the  stable  cavities.  The  colours  and  symbols  are  as  in  Figure  6. 

4.2  Dimple  dynamics 

The  geometric  properties  of  the  dimple  were  described  above.  In  this  section  the  behaviour 
of  the  dimple  in  time  is  analyzed.  A  first  impression  of  the  differences  between  different 
modes  of  oscillation  is  presented  in  Figure  12.  The  cavity  depth  and  angle  with  respect 
to  the  vertical  1  are  drawn  for  the  E-mode  (left  panels),  the  V-mode  (middle  panels),  and 
the  B-mode  (right  panels)  at  nozzle  height  £  =  7.4.  The  E-mode  only  shows  very  slight 
oscillations  in  depth,  but  very  regular  oscillations  in  the  angle  time  series.  The  angle  in  the 
V-mode  on  the  other  hand  is,  as  expected,  almost  zero,  with  clear  oscillations  in  the  depth 
series.  The  B-mode  shows  oscillations  in  both  depth  and  angle.  As  the  behaviour  of  the 
dimple  in  the  B-mode  seemed  fairly  chaotic  at  first  sight,  the  oscillations  are  remarkably 
regular.  In  contrast  to  the  E-mode,  with  one  clear  frequency,  and  the  V-mode,  which  has  a 
more  intermittent  behaviour,  the  signal  shows  multiple  frequencies  with  a  clear  periodicity. 

The  frequency  spectra  of  the  three  modes  of  instability  (Figure  13)  show  spectral  peaks 
at  slightly  different  frequencies.  The  E-mode  has  a  peak  around  55  Hz  in  the  power  spectra 
of  cavity  depth  and  angle  with  the  vertical.  A  second  peak  is  found  in  the  depth  spectrum 
at  twice  the  55  Hz  frequency,  around  108  Hz,  that  is  larger  than  the  55  Hz  peak.  This 
higher  frequency  mirrors  the  double  crossing  of  the  vertical  in  the  apparent  pendulum¬ 
like  movement  of  the  dimple  in  the  2-dimensional  image  of  the  3-dimensional  elliptical 

1This  angle  is  defined  as  the  angle  between  the  axis  of  symmetry  (the  continuation  of  the  centerline  of 
the  jet)  and  the  line  drawn  from  the  crossing  of  the  centerline  of  the  jet  and  the  water  surface  to  the  tip  of 
the  dimple. 
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Figure  12:  Cavity  depth  (upper  panels)  and  angle  of  the  cavity  with  the  vertical  (lower 
panels)  from  time  series  in  the  E-mode  (left  panels),  the  V-mode  (middle  panels),  and  the 
B-mode  (right  panels)  at  nozzle  height  (  =  7.4. 


movement.  The  V-mode  spectrum  is  more  noisy,  but  shows  the  highest  peak  around  52 
Hz  in  the  depth  series,  comparable  to  the  E-mode.  As  expected,  no  distinct  peaks  are 
observed  in  the  angle  power  spectrum.  Last,  the  spectra  of  the  B-mode  time  series  show 
a  high  peak  at  low  frequency  (14.5  Hz  in  the  angle  spectrum  and  twice  this  value,  29  Hz, 
in  the  depth  spectrum)  with  lower  peaks  at  higher  harmonics.  One  of  the  peaks  is  at 
57  Hz,  again  comparable  to  the  55-Hz  peak  in  the  E-mode  spectra.  Although  the  exact 
values  of  the  frequency  peaks  slightly  differ,  we  not  believe  them  to  be  actually  different 
and  the  fundamental  oscillation  frequency  does  therefore  not  seem  to  change  as  the  dimple 
oscillation  enters  another  mode. 

Figure  14  shows  the  frequency  spectra  for  the  highest  Reynolds  number  considered  at 
the  different  nozzle  heights.  Unlike  the  spectra  at  one  nozzle  height,  the  spectral  peak 
shows  up  at  increasingly  lower  frequency  for  increasing  nozzle  height.  (An  exception  is 
in  the  (  =  29.6  line,  which  does  not  show  one  clear  peak  at  all  because  the  jet  is  going 
turbulent  around  this  nozzle  height.)  The  same  behaviour,  but  less  distinct,  is  observed  at 
the  other  high-Reynolds  number  measurements.  For  the  lower  Reynolds  number  regime,  no 
such  pattern  is  found. 

In  the  regime  diagram  (Figure  6)  a  separate  mode  was  assigned  to  the  seemingly  stable 
cavities  at  higher  nozzle  heights  which  showed  very  small  and  rapid  oscillations.  This 
shivering-like  behaviour  does  not  seem  to  be  an  oscillation  of  the  dimple  itself.  An  example 
of  a  time  series  in  this  mode  is  give  in  Figure  15.  The  larger  troughs  were  probably  dust 
particles  or  other  disturbances  swept  through  the  cavity  by  the  surface  flow.  The  smaller 
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Frequency  spectrum 


Figure  14:  Frequency  spectra  of  the  time  series  at  Re  =  3360.  The  line  colours  for  nozzle 
heights  are  as  in  Figure  6. 


noisy  oscillations  are  the  observed  small  and  rapid  oscillations.  The  frequency  spectrum 
of  the  middle  part  of  the  time  series  confirms  that  there  is  no  clear  spectral  peak  at  high 
frequency.  These  oscillations  are  thus  indeed  noise.  The  noise  is  likely  caused  by  the 
increased  turbulence  in  the  jet,  which  has  more  time  to  develop  at  the  larger  nozzle  height. 


Cavity  depth  time  series 


Time  [s] 

Power  spectrum  of  small  oscillations 


Figure  15:  Time  series  of  the  cavity  depth  and  its  frequency  spectrum  at  £  =  29.6  and 
Re  =  1030. 
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5  Discussion  and  Conclusions 


The  behaviour  of  a  dimple  that  is  formed  by  an  air  jet  impinging  at  right  angles  on  a  water 
surface  was  studied.  In  particular  the  dependence  of  the  geometric  properties,  onset  of  oscil¬ 
lation  and  modes  of  oscillation  were  studied  as  a  function  of  the  air  flow  rate  at  the  nozzle, 
defined  as  the  jet  Reynolds  number,  and  the  source  height,  given  by  the  nondimensional 
nozzle  height. 

At  low  jet  Reynolds  number  a  stable  cavity  is  formed.  When  the  air  flow  rate  is  turned 
up,  a  sequence  of  oscillation  modes  is  observed.  The  range  over  which  the  modes  of  oscilla¬ 
tions  are  found  depends  on  the  nozzle  height.  Also  not  all  unstable  modes  are  observed  at 
every  nozzle  height  considered.  The  unstable  modes  identified  were: 

•  E-mode:  The  tip  of  dimple  traces  out  an  ellipse.  This  mode  was  only  found  at  the 
smallest  nozzle  height  and  shows  very  regular  oscillations. 

•  V-mode:  The  dimple  oscillates  vertically.  This  mode  was  found  at  all  nozzle  heights 
except  for  the  fourth  one.  The  regime  diagram  however  indicates  that  the  Re-range 
where  this  mode  is  expected  at  the  fourth  nozzle  height  is  not  sampled  in  this  study. 
The  oscillations  in  this  mode  are  sometimes  rather  regular,  but  other  times  they 
showed  more  intermittent  behaviour  (see  e.g.  the  upper  middle  panel  in  Figure  12). 

•  B-mode:  The  oscillation  overshoots,  and  the  tip  of  the  dimple  bends  and  rolls  up  the 
side  of  the  dimple.  This  mode  was  observed  at  the  lowest  two  nozzle  heights.  Again, 
the  Re-range  at  which  this  mode  is  expected  at  the  higher  nozzle  heights  was  not 
sampled.  The  oscillations  in  this  mode  are  again  very  regular. 

•  At  the  largest  nozzle  heights,  very  small  oscillations  of  the  dimple  are  observed  at  low 
jet  Reynolds  number,  which  shows  up  as  noise  in  the  time  series.  We  speculate  this 
is  due  to  an  instability  of  the  jet. 

•  In  the  high  Reynolds  number  range  two  other  modes  are  found: 

—  Bubbling  and  splashing.  This  mode  is  only  observed  at  the  lowest  nozzle  height, 
but  will  probably  eventually  set  in  at  higher  nozzle  heights  as  well  if  the  flow  rate 
is  increased  further.  It  seems  that  a  critical  dimple  depth  exists  above  which  this 
mode  sets  in. 

—  A  widening  of  the  dimple.  We  believe  this  shape  to  arise  when  the  jet  overcomes 
the  surface  tension.  This  mode  is  not  observed  at  the  lowest  nozzle  height. 

Two  distinctively  different  regimes  are  found.  For  the  smallest  three  nozzle  heights,  the 
Reynolds  number  at  which  a  next  mode  of  oscillation  is  entered  increases  with  increasing 
nozzle  height.  For  the  largest  two  nozzle  heights  on  the  other  hand  the  required  Reynolds 
number  for  a  certain  mode  of  oscillation  decreases  with  increasing  nozzle  height.  Also,  the 
cavity  depth  increases  rapidly  around  the  point  of  instability  for  the  smallest  three  nozzle 
heights,  while  it  only  increases  slightly  in  the  larger  two  cases.  Overall,  the  depth  of  the 
cavity  increased  with  increasing  jet  strength  and  decreases  with  increasing  nozzle  height. 
The  difference  in  cavity  depth  with  nozzle  height  is  small  at  low  jet  Reynolds  number  and 
larger  at  high  jet  Reynolds  number. 
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Clear  evidence  is  found  for  two  competing  effects  in  the  system  as  the  air  flow  rate 
in  the  jet  is  increased.  If  both  the  nozzle  height  and  the  flow  rate  are  small,  the  dimple 
depth  increases  with  increasing  flow  rate  until  the  dimple  becomes  unstable.  If  the  nozzle 
is  sufficiently  far  away  from  the  water  surface  however,  turbulence  develops  in  the  jet  when 
the  flow  rate  is  high  enough.  In  a  turbulent  jet  the  velocity  decreases  with  distance  from 
the  nozzle  and  the  rate  of  increase  of  impact  of  the  jet  therefore  levels  off. 

The  time  dependent  behaviour  of  the  dimple  oscillations  was  studied  using  their  fre¬ 
quency  spectra.  The  spectra  suggest  that  the  frequency  does  not  change  much  when  the 
nozzle  height  is  fixed  (spectral  peaks  at  different  higher  harmonics  do  show  up  with  in¬ 
creasing  Reynolds  number).  For  the  smallest  nozzle  height  a  fundamental  frequency  around 
55  Hz  is  found.  This  frequency  can  be  linked  to  the  waves  produced  on  the  surface.  The 
dispersion  relation  for  deep  water  gravity  waves  is  ui  =  \fgk.  With  c o  =  ck,  c  =  A/T,  and 
k  =  2n/\,  the  frequency  of  the  wave  is  found  from 


where  u>  is  the  angular  frequency,  c  the  phase  velocity,  k  the  wave  number,  T  the  wave 
period,  and  A  the  wave  length.  If  the  dimple  represents  half  a  wave,  the  wave  length  is 
twice  the  dimple  width.  The  small  dimples  are  about  1  mm  wide,  giving  a  frequency  of  28 
Hz.  The  fundamental  frequency  of  55  Hz  is  twice  this  frequency.  At  the  higher  Reynolds 
numbers  the  largest  peak  in  the  spectrum  shifts  to  a  lower  frequency  with  increasing  nozzle 
height.  It  should  be  noted  that  not  all  spectra  show  clear  narrow  peaks.  As  the  filter 
removes  part  of  the  time  series,  perhaps  longer  time  series  would  give  a  better  result.  Also 
ensemble  measurements  with  more  than  one  time  series  per  Re  —  (  pair  could  increase  the 
signal-to-noise  ratio,  especially  in  the  modes  where  the  dimple  oscillations  are  irregular. 

A  few  open  question  remain  after  this  study.  In  particular,  it  would  be  very  helpful 
to  know  more  about  the  air  jet  itself.  Visualization  of  the  jet  would  provide  information 
on  the  formation  of  turbulence,  the  radius  at  impact  and  its  relation  to  the  dimple  radius. 
Without  specific  information  on  the  jet,  it  is  very  difficult  to  form  an  idea  of  the  physics 
involved,  and  results  remain  speculative. 

Secondly,  surface  tension  seem  to  be  an  important  factor  in  the  dimple  stability.  This 
was  already  confirmed  by  earlier  studies  (in  particular  Rosier  and  Steward  (8)).  If  the  radius 
of  the  jet  could  however  be  measured  accurately  in  this  study,  the  Bond  and  Weber  number 
can  be  calculated  and  related  to  the  jet  Reynolds  number  and  the  nozzle  height  (the  jet 
spreads  and  thus  reaches  a  larger  radius  at  impact  at  larger  nozzle  height).  This  is  likely 
to  shed  a  more  physical  light  on  the  now  speculative  lines  in  the  regime  diagram. 

Thirdly,  measurements  should  be  added  in  the  regime  diagram  in  the  700  to  1500 
Reynolds  number  range,  in  particular  where  the  speculative  lines  suggest  the  presence  of  a 
certain  mode  at  a  certain  nozzle  height  which  is  not  yet  confirmed  due  lack  of  data. 
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Abstract 

We  study  a  population  model  in  which  there  are  two  species,  one  of  which  has  a 
juvenile  and  adult  life  stage.  The  adults  of  the  first  species  prey  on  the  second  species, 
which  in  turn  preys  on  the  juveniles  of  the  first.  One  version  of  the  model  represents 
systems  where  neither  species  can  survive  on  its  own,  although  we  find  that  both  can 
survive  through  mutual  predation.  To  avoid  extinction,  the  two  types  of  predation  must 
be  of  sufficient  strength  and  in  appropriate  proportion  to  one  another.  Another  version 
of  the  model  represents  systems  where  each  species  can  survive  without  the  other,  and 
there  we  find  that  mutual  predation  is  capable  of  increasing  both  of  their  equilibrium 
populations  or  creating  stable  limit  cycles. 

1  Introduction 

An  organism’s  trophic  level  is  the  position  that  it  occupies  in  the  food  web,  defined 
by  the  organisms  that  it  eats  and  vice  versa.  However,  few  organisms  occupy  the  same 
trophic  levels  throughout  their  lives.  Typically  an  organism’s  predators  and  prey  change 
over  its  lifetime,  as  do  the  organisms  of  similar  trophic  level  with  which  it  competes  for 
food.  Werner  and  Gilliam  [2]  review  many  examples  of  ecosystems  in  which  competitive 
and  predatory  relationships  are  age-  and  size-dependent.  As  an  instance  of  the  latter, 
adult  salamanders  and  newts  prey  on  one  another’s  juveniles.  Similarly,  frogs  eat  insects, 
while  insect  larva  eat  tadpoles.  Although  competition  is  important,  we  do  not  consider  it 
in  this  report,  where  our  object  is  to  explore  solely  the  dynamics  that  arise  in  ecological 
models  with  age-structured  predation. 


2  Development  of  the  models 

Any  model  for  age-structured  predation  must  have  at  least  two  species,  and  at  least 
one  of  the  species  must  have  multiple  life  stages.  Some  ecological  models  have  used 
continuous  age  or  size  variables  to  describe  life  stage,  which  requires  partial  differential 
equations  to  describe  the  change  of  populations  in  time,  but  the  simplest  possibility  is 
to  have  two  discrete  life  stages.  Thus,  the  simplest  possible  model  has  three  populations 
in  total:  one  species  with  one  life  stage  and  one  species  with  both  juvenile  and  adult 
life  stages.  For  concreteness  we  shall  call  our  populations  tadpoles  (T),  frogs  (F),  and 
insects  (/),  where  the  frogs  eat  the  insects,  the  insects  eat  the  tadpoles,  and  the  tadpoles 
and  frogs  beget  one  another  through  recruitment  and  reproduction.  For  simplicity  we 
shall  assume  that  recruitment  occurs  at  a  rate  proportional  to  tadpole  biomass,  that 
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reproduction  occurs  at  a  rate  proportional  to  frog  biomass,  and  that  a  constant  fraction 
of  the  biomass  being  transferred  through  predation  is  lost  to  metabolic  inefficiencies. 
The  model  as  described  thus  far  is  diagramed  in  Figure  1.  Through  the  “feeding/death” 
fluxes  in  Figure  1,  each  population  may  gain  biomass  by  feeding  on  the  external  envi¬ 
ronment  (which  excludes  the  other  two  populations  explicitly  modeled) ,  or  lose  biomass 
through  death.  All  that  remains  to  fully  define  our  population  model  is  to  specify  the 
functional  forms  of  the  predation  and  feeding/death  terms. 


Figure  1:  Biomass  flux  in  the  tadpole-frog-insect  system. 


2.1  Feeding  or  death  terms 

Assuming  the  tadpoles,  frogs,  and  insects  do  not  compete  with  one  another,  each  pop¬ 
ulation’s  feeding/death  term  should  be  independent  of  the  other  two  populations.  The 
simplest  choice  is  a  proportional  law,  which  in  the  absence  of  predation  creates  expo¬ 
nential  growth  or  decay,  depending  on  the  parameter  values.  Unbounded  exponential 
growth  is  unrealistic,  but  exponential  decay  is  feasible:  it  represents  a  case  where  the 
two  species  cannot  survive  without  predation,  and  we  analyze  such  a  model  in  Section  3. 
We  also  wish  to  consider  a  scenario  in  which  both  species  would  survive  without  the 
predation.  This  requires  that  the  system  have  carrying  capacities,  which  cause  initially 
exponential  growth  to  saturate  at  finite  values.  We  must  put  a  carrying  capacity  on  the 
insects,  but  we  have  a  choice  between  putting  one  on  the  frogs,  the  tadpoles,  or  both. 
We  shall  somewhat  arbitrarily  place  a  carrying  capacity  on  frogs  and  not  on  tadpoles. 
In  the  biological  reality  of  frogs  and  tadpoles  this  is  usually  accurate;  frogs  will  typically 
run  out  of  environmental  resources  before  tadpoles  do.  But  for  other  species,  resources 
may  certainly  be  scarcer  for  juveniles  than  for  adults.  We  analyze  the  carrying  capacity 
model  in  Section  4. 
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2.2  Predation  terms 


The  simplest  possible  predation  law  is  a  quadratic  one  in  which  the  rate  of  predation 
is  proportional  to  both  predator  and  prey  populations,  the  so-called  Holling  type  I 
functional  form  [1],  Some  models  of  age-structured  predation  employing  these  forms 
were  studied  by  Whitehead  and  Doering  [3],  but  such  a  form  fails  to  reflect  the  fact 
that  increasing  the  amount  of  prey  beyond  a  certain  point  stops  benefiting  the  predators 
because  there  is  a  limit  to  how  fast  they  can  eat  and  metabolize  the  prey.  The  Holling 
type  II  functional  form  avoids  this  problem  by  saturating  as  the  amount  of  prey  becomes 
large: 

„  ,  .  (prey  population)  (predator  population) 

rate  of  predation  =  C-i - - - - - —— - - — ; — - , 

1  +  C 2 (prey  population) 

where  C\  and  C2  are  constant  parameter.  We  have  examined  some  models  employing  the 
type  I  form,  but  it  leads  to  unbounded  growth  in  certain  cases,  so  we  shall  henceforth 
always  use  the  type  II  form.  It  is  not  hard  to  justify  needing  a  predation  law  with 
saturation,  but  it  is  certainly  not  clear  a  priori  that  the  type  II  form  is  the  best  choice. 
We  justify  this  choice  in  the  appendix,  where  we  derive  the  type  II  form  as  an  asymptotic 
limit  of  a  higher-dimensional  dynamical  system  and  numerically  compare  the  reduced 
system  with  the  higher-dimensional  system. 


3  Model  without  carrying  capacities 

Let  the  feeding/death  term  for  tadpoles  add  biomass  at  the  rate  exT,  where  ex  can  take 
either  sign,  and  likewise  for  frogs  and  insects.  Then,  the  dimensional  ODE  governing 
the  biomass  of  each  population  is 


IT 

T=-7T  +  /3F-k— - — 

1  +  l-iT 

(1) 

F  =  aT  (F  +  pp  A 

1  +  vl 

(2) 

■  IT  FI 

/  =  e,/  +  „«1  +  (i  T  \l  +  yV 

(3) 

where 


7  =  a  —  ex 
C  =  P  -  £f- 

Without  frogs  or  insects,  tadpoles  should  die  out,  and  likewise  for  frogs,  so  7  and  (  must 
be  positive.  Only  the  ej  parameter  may  be  negative.  If  a  <  7,  tadpole  feeding  is  adding 
biomass  to  the  system,  and  the  same  is  true  for  frogs  if  (3  <  (,.  The  parameters  ?y/  and  r/p 
are  metabolic  efficiencies,  so  they  must  fall  between  0  and  1.  We  now  nondimensionalize 
the  system  by 

t^H  TmIt  F 1— >  -3-F  (4) 

This  particular  nondimensionalization  is  attractive  because  it  retains  unique  coefficients 
on  all  the  nonlinear  predation  terms  and  minimizes  the  number  of  free  parameters 
(seven).  Its  drawback  is  that  the  different  populations  are  no  longer  in  equivalent  units 
of  biomass,  so  they  cannot  be  meaningfully  compared.  This  is  tolerable  because  we  are 
more  interested  in  qualitative  system  behavior  than  relative  population  biomasses.  The 
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nondimensional  ODE  is 
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3.1  Model  without  predation 

Without  predation,  the  model  is  linear,  and  the  T-F  system  decouples  from  the  insects: 


dL 

dt 


1  0\  /at\ 

-6  0  AF 
0  c/  \A// 


The  tadpoles  and  frogs  decay  exponentially  if  a  <  b  and  grow  exponentially  if  a  >  b, 
while  the  insects  decay  exponentially  if  c  <  0  and  grow  exponentially  if  c  >  0.  The 
behaviors  of  the  uncoupled  systems  split  the  full  system  rather  naturally  into  four 
separate  cases.  (Structurally  unstable  parameter  values  such  as  a  =  b  or  c  =  0  will 
not  be  considered  in  anything  that  follows.)  The  only  equilibrium  of  this  system  is  the 
origin,  and  the  decoupled  systems  are  rather  boring  without  predation.  Predation  can 
create  stable  fixed  points  and  limit  cycles  for  all  four  combinations  of  signs  of  (a  —  b) 
and  c,  but  only  the  case  in  which  the  T  —  F  and  I  systems  both  decay  without  predation 
is  biologically  reasonable,  so  we  shall  restrict  ourselves  to  this  case.  Henceforth  in  this 
section,  a  <  b  and  c  <  0. 

When  /  and  the  T-F  system  both  decay  without  predation,  the  mechanism  by 
which  predation  can  stabilize  the  system  is  the  following.  Suppose  that  biomass  enters 
the  system  though  frog  feeding  ((  <  (3)  and  leaves  the  system  through  tadpole  death 
(7  >  a),  with  tadpole  death  dominating  when  the  T-F  system  is  isolated.  In  other 
words,  the  rate  of  reproduction  is  higher  than  ideal.  Introducing  insects  creates  a  flux 
of  biomass  from  tadpoles  to  frogs,  where  it  is  used  to  create  more  biomass  from  external 
feeding,  rather  than  lost  to  tadpole  death.  This  effect  can  prevent  the  entire  system 
from  decaying  to  zero,  even  with  the  additional  sources  of  biomass  loss  by  insect  death 
and  metabolic  inefficiencies. 


3.2  Lyapunov  bound 

Not  only  is  the  origin  linearly  stable  when  a  <  b  and  c  <  0,  the  Lyapunov  functional 
L  =  |F+^F  +  /  suffices  to  show  that  all  solutions  decay  to  the  origin.  This  is  a  rather 
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narrow  range  of  validity,  but  it  suggests  the  possible  importance  of  the  ratio  In 
terms  of  dimensional  variables, 

eg  f3 


"77  =  VIV  F~- 
df  7 

Biologically,  gigF  is  the  squared  geometric  mean  of  the  metabolic  efficiencies,  so  we  can 
think  of  this  term  roughly  as  the  metabolic  efficiency  of  the  entire  system. 


3.3  Nontrivial  equilibria 


At  nontrivial  fixed  points, 


F  =  T 


dl 

1  +  T 


aT  =  F 


0  =  c  + 


eT 

1  +  T 


fF 
1  +  /' 


(8) 

(9) 

(10) 


From  Equation  (8),  I  and  T  uniquely  define  F  at  a  fixed  point.  Using  this  equation  to 
eliminate  F  from  the  latter  two  yields  two  polynomials  equations  in  /  and  T: 


o(l  +  I)(l  +  T)  =  (1  +  T  +  dl){b  +bl-  gl)  (11) 

c(l  +  I)  (1  +  T)  +  eT(l  +  J)  =  /T(l  +  T  +  dl) .  (12) 

Equation  (11)  is  linear  in  T,  and  Equation  (12)  is  linear  in  /,  so  they  may  be  used  to 
find  explicit  expressions  for  T(T)  and  /(T),  respectively: 

T_  dl(b+(b-g)l)  1 
(a  +  g  —  b)I  +  a  —  b 
7_  fT2  —  (c  +  e  —  f)T  —  c . 

(c  +  e  —  df)T  +  c 

Evidently,  /  and  T  define  one  another  uniquely.  Applying  T{I)  to  Equation  12  gives 
a  cubic  polynomial  for  /  at  the  equilibria,  so  there  are  at  most  three  positive  real 
population  equilibira. 

i3I 3  +  i2I2  +  ill  +  io  =  0,  where 


10  =  —  e(a  —  b)2  <  0 

11  =  (a  —  b)  [2  e(b  —  g)  +  bd(c  +  e)  +  a(df  —  2e)] 

%2  =  d(c  +  e)  [(a  —  b)(b  —  g)  +  b{a  +  g  —  b)]  —  e(a  +  g  —  b)2  +  adf{a  +  g  —  b  —  bd ) 
i3  =  d(b  -  g)[(c  +  e)(a  +  g  -  b)  -  adf } 

The  cubic  equation  for  I  of  course  has  explicit  algebraic  solutions,  but  they  are  too 
messy  to  be  of  use,  so  although  we  have  explicit  expressions  for  F  and  T  in  terms  of 
/,  we  cannot  generally  predict  whether  F  and  T  will  be  positive  when  I  is  positive. 
The  best  we  can  do  analytically  is  infer  some  partial  information  about  the  signs  of 
polynomial  roots.  For  this,  we  also  need  the  first  and  last  coefficients  of  the  cubic 
polynomial  governing  T,  which  we  derive  by  applying  7(T)  to  Equation  11: 

t3T3  +  t2T2  +  t\T  +  T  =  0,  where 


T  =  —c2g  <  0 

h  =  f[(c+  e)(a  +  g  -  b)  -  adf}. 
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Theorem  Let  a  <b  and  c  <  0.  If  g  <  b  and  adf  <  (c  +  e)(a  +  g  —  b),  there  are  either 
1  or  3  positive  equilibria.  Otherwise,  there  are  either  0  or  2  positive  equilibria. 

Proof  The  I  polynomial  is  cubic,  and  I  uniquely  defines  F  and  T,  so  there  are  at 
most  three  real,  positive  equilibria.  It  is  clear  from  Equation  8  that  F  is  positive  when 
I  and  T  are,  so  it  suffices  to  know  when  the  I  and  T  polynomials  have  corresponding 
positive  roots.  There  are  three  ways  in  which  the  number  of  positive  roots  of  the  I  or  T 
polynomials  may  change.  Firstly,  a  root  may  remain  real  but  leave  the  positive  octant 
if  ?'o  or  to  becomes  negative,  but  we  have  eliminated  this  possibility  by  assumption. 
Secondly,  the  number  of  positive  equilibria  may  change  by  two  when  a  pair  of  equilibria 
become  complex  simultaneously,  a  saddle-node  bifurcation.  Thirdly,  an  equilibrium 
may  move  off  to  infinity  when  i3  or  f3  pass  through  zero.  By  dividing  parameter  space 
into  four  regions  in  which  i3  and  to  do  not  change  sign,  we  are  assured  that  the  number 
of  equilibria  within  each  region  may  change  only  by  saddle-node  bifurcations.  These 
regions  are 


I  =  {i 3  >  0,  to  >  0} 
II  =  {?3  <0,  f3  >  0} 

III  =  {*3  >0,  to  <  0} 

IV  =  {*3  <  0,  t3  <  0} 


{g  <  6,  adf  <  (c  +  e)(a  +  g  -  b)} 

{g  >  6,  adf  <  (c  +  e)(a  +  g  -  b)} 

{g  <  6,  adf  >  (c+  e)(a  +  g  -  b)} 

{g  >  6,  adf  >  (c+  e)(a  +  g  -  b)}. 


If  we  know  the  number  of  positive  equilibria  at  one  point  in  each  parameter  region,  we 
know  the  number  of  positive  equilibria  at  all  points  in  that  parameter  region,  modulo 
2.  So,  we  chose  one  such  test  point  in  each  parameter  region  and  computed  the  equi¬ 
libria  numerically,  inferring  from  this  the  possible  number  of  positive  equilibria  in  each 
parameter  region.  The  results  are  tabulated  in  Table  1.  Evidently,  there  may  be  1  or 
3  positive  equilibria  in  parameter  region  I,  and  0  or  2  otherwise.  Parameter  region  I 
is  defined  precisely  by  the  condition  that  g  <  b  and  adf  <  (c  +  e)  (a  +  g  —  b) ,  so  the 
theorem  is  proved.  I 


Table  1:  Number  of  nontrivial  equilibria  in  each  of  the  four  parameter  regions,  as  inferred 
from  a  test  points,  (a,  b,  c,  d,  e,  /,  g),  in  parameter  space. 


Parameter  region 

Test  point 

Number  of  positive 
equilibria  at  test  point 

Number  of  positive 
equilibria  in  region 

I 

(0.9,1, -0.5, 1,1.5, 0.1, 0.5) 

1 

1  or  3 

II 

(0.9,1, -0.5, 1,1.5, 0.1, 2) 

2 

0  or  2 

III 

(0.9,1,-0.5,100,1.5,0.1,0.5) 

0 

0  or  2 

IV 

(0.9,1,-0.5,100,1.5,0.1,2) 

0 

0  or  2 

By  the  preceding  theorem,  we  have  partial  information  about  the  number  of  equilib¬ 
ria  inside  and  outside  region  I.  Because  a  saddle-node  bifurcation  creates  two  equilibria 
of  opposite  stability,  we  also  know  that  there  can  be  at  most  two  stable  equilibria  inside 
parameter  region  I,  and  at  most  one  stable  equilibrium  outside  it.  We  would  of  course 
like  precise  conditions  on  when  the  saddle-node  bifurcations  occur,  thereby  further  di¬ 
viding  our  parameter  regions  into  ones  in  which  the  numbers  of  positive  equilibria  are 
exactly  known.  This  requires  knowing  when  I  or  T  become  complex.  Since  I  and  T  are 
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governed  by  cubic  equations,  this  can  in  principle  be  determined  from  the  discriminants 
of  those  equations.  The  I  equation  has  one  real  and  two  complex  roots  precisely  when 

A/  =  18*3*2*i*o  —  4»2?o  +  i\i\  —  4 *3*f  —  27*3*q  <  0, 

and  likewise  for  the  coefficients  of  the  T  equation,  so  the  parameter  regimes  we  seek 
are  divided  by  the  surfaces  on  which  the  discriminants  vanish.  Unfortunately,  these 
inequalities  are  prohibitively  messy  when  expressed  in  terms  of  the  problem  parameters. 
In  biological  modeling,  one  is  more  interested  in  qualitative  behaviors  than  the  precise 
values  at  which  bifurcations  occur,  so  we  will  be  content  to  observe  the  saddle-node 
bifurcations  numerically. 

3.4  Numerical  exploration  of  parameter  space 

Asymptotic  analysis  shows  that  the  predation  parameters  cannot  be  small  compared  to 
the  population  decay  rates  if  predation  is  to  stabilize  the  decay.  In  fact,  numerical  exper¬ 
iments  reveal  that  the  predation  parameters  must  typically  be  about  an  order  of  mag¬ 
nitude  larger  than  the  decay  rates.  In  light  of  this,  we  shall  fix  (a,b,c)  =  (0.9,1,— 0.1) 
and  expect  interesting  behavior  when  predation  parameters  are  0(1). 

3.4.1  Looking  for  bifurcations 

When  ( d,e,f,g )  =  (0.2, 1,  0.2, 0.5),  there  are  three  positive  equilibria,  the  maximum 
number  possible,  so  we  choose  this  as  a  starting  point  from  which  to  explore  the  four¬ 
dimensional  space  of  predation  parameters.  Linearizing  the  system  about  these  equilib¬ 
ria  and  computing  matrix  eigenvalues  numerically,  we  find  that  none  of  the  fixed  points 
are  stable.  Biologically,  we  are  interested  in  regimes  where  there  are  stable  fixed  points, 
so  we  use  the  bifurcation  continuation  package  MATCONT  to  continue  these  equilibria 
in  parameter  space.  Arbitrarily  choosing  the  parameter  e  in  which  to  continue  the  equi¬ 
libria,  we  obtain  the  bifurcation  diagram  of  Figure  2.  We  have  chosen  the  coordinate  I 
for  the  ordinate  of  our  bifurcation  diagrams  because  the  value  of  /  sometimes  becomes 
unrealistically  small,  and  we  wish  to  see  when  this  is  so. 

Continuing  in  e,  we  find  a  saddle-node  bifurcation  ( LP ),  a  subcritical  Hopf  bifurca¬ 
tion  ( H+ ),  and  a  neutral  saddle  (NS).  These  are  all  bifurcations  of  codimension  1,  as 
we  would  expect  to  find  when  varying  only  one  parameter.  To  access  the  higher-level 
structure  of  parameter  space,  we  would  like  to  find  higher-codimension  bifurcations. 
The  MATCONT  package  can  in  general  only  find  bifurcations  up  to  codimension-2,  so 
we  shall  settle  for  this,  though  ideally  we  would  like  to  find  bifurcations  of  codimension 
up  to  the  dimension  of  our  parameter  space.  Codimension-2  bifurcations  are  typically 
found  by  continuing  a  codimension- 1  bifurcation  in  two  parameters,  so  we  shall  continue 
all  of  the  bifurcations  of  Figure  2.  Continuing  in  e  and  g  yields  the  bifurcation  diagram 
of  Figure  3,  in  which  we  see  two  types  of  codimension-2  bifurcations:  Bogdanov-Takens 
bifurcations  ( BT ),  and  generalized  Hopf  bifurcations  (GH),  also  known  as  Bautin  bifur¬ 
cations.  Continuing  the  codimension- 1  bifurcations  of  Figure  2  in  any  other  combination 
of  two  parameters  does  not  yield  any  other  types  of  codimension-2  bifurcations. 

In  the  neighborhood  of  a  Bogdanov-Takens  bifurcation,  there  are  guaranteed  to  be 
a  saddle-node  bifurcation  and  a  Hopf  bifurcation  (both  of  which  we  have  seen  already), 
and  also  a  saddle  homoclinic  bifurcation.  In  the  neighborhood  of  a  generalized  Hopf 
bifurcation,  there  are  guaranteed  to  be  both  supercritical-  and  subcritical  Hopf  bifur¬ 
cations,  and  a  fold  bifurcation  of  limit  cycles.  Although  we  are  only  assured  of  these 
system  behaviors  in  local  neighborhoods  of  the  codimension-2  bifurcations,  we  can  rea¬ 
sonably  expect  to  see  them  all  around  parameter  space. 
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Figure  2:  Continuation  in  e  of  the  three  starting  equilibria. 


Figure  3:  Continuation  in  e  and  g  of  the  Hopf  bifurcation  of  Figure  2. 
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Before  exploring  the  effects  of  each  parameter,  we  would  like  to  find  a  baseline  point 
in  parameter  space  at  which  all  parameters  are  of  similar  magnitude,  and  at  which  a 
stable  fixed  point  exists.  There  is  always  a  stable  fixed  point  near  a  Hopf  bifurcation, 
so  we  continue  the  Hopf  bifurcation  of  Figure  2  in  various  pairings  of  the  predation 
parameters,  arriving  at  a  point  where  (d,e,f,g)  is  near  (2, 4, 2,  2).  We  shall  use  this 
and  other  nearby  values  as  our  baseline  parameter  points. 

3.4.2  Overall  predation  strength 

To  study  the  effect  of  the  overall  predation  strength  in  the  system,  we  fix  the  ratios 
between  all  four  predation  parameters  and  vary  them  proportionally.  The  resulting 
diagrams  are  shown  in  Figure  4,  where  we  have  used  two  sightly  different  parameter 
ratios  to  exhibit  one  case  each  where  the  Hopf  bifurcation  is  supercritical  or  subcritical. 
In  any  of  the  bifurcation  diagrams  that  follow,  the  predation  parameter  ratios  could 
be  chosen  to  realize  either  type  of  Hopf  bifurcation,  but  the  distinction  is  not  very 
important  because  limit  cycles  in  this  system  are  not  robust  biologically,  for  the  following 
reason.  In  the  chosen  regime  of  a,  b  and  c,  the  coordinates  of  the  stable  fixed  point 
appear  to  always  be  such  that  I  <C  T,F.  As  the  limit  cycle  emerging  from  a  Hopf 
bifurcation  grows,  the  oscillations  become  strongly  nonlinear,  consisting  of  a  fast  part 
and  slow  part.  During  the  slow  part  of  the  cycle,  the  value  of  I  is  very  small.  Without 
traveling  very  far  in  parameter  space,  either  the  limit  cycle  is  destroyed  in  a  homoclinic 
bifurcation,  or  /  becomes  so  small  during  the  slow  part  of  the  cycle  that  in  the  non¬ 
continuum  of  reality,  the  insect  population  would  go  extinct,  bringing  the  same  fate 
to  the  frogs  and  tadpoles.  If  we  altered  the  model  to  include  stochasticity  or  discrete 
populations,  we  could  reasonably  expect  it  to  display  this  biologically-realistic  extinction 
during  such  troughs  of  I.  The  type  of  Hopf  bifurcation  controls  whether  or  not  a  stable 
limit  cycle  coexists  with  a  stable  fixed  point,  but  since  the  limit  cycle  is  a  much  less 
robust  structure  than  the  fixed  point,  the  distinction  is  not  very  important. 

It  is  evident  from  Figure  4  that  the  system  has  no  stable  fixed  points  or  limit 
cycles  when  the  predation  is  too  weak.  This  is  expected,  since  the  predation  must 
stabilize  the  exponential  decays  of  the  uncoupled  systems,  and  it  makes  clear  that 
predation  cannot  do  this  as  a  mere  perturbation  on  the  uncoupled  systems.  As  predation 
increases,  the  fixed  point  not  only  remains  stable,  but  its  basin  of  attraction  enlarges. 
Simultaneously,  however,  the  equilibrium  insect  population  decreases  toward  zero,  and 
the  frog  and  tadpole  populations  decrease  asymptotically  toward  identical  finite  values. 
When  the  basin  of  attraction  is  too  small,  a  perturbation  could  send  the  system  across 
the  separatrix  and  onto  a  trajectory  heading  toward  the  origin  (extinction).  When 
the  equilibrium  insect  population  is  too  small,  it  is  vulnerable  to  eradication  by  some 
catastrophic  event,  in  which  fate  the  frogs  and  tadpoles  would  follow.  The  fitness  of  the 
insect  population  would  be  maximized  at  some  intermediate  predation  strength  that 
balances  these  two  factors. 

3.4.3  Relative  strengths  of  the  two  types  of  predation 

The  parameters  d  and  e  convey  the  strength  of  the  insect-tadpole  predation,  and  the 
parameters  /  and  g  do  likewise  for  the  frog-insect  predation.  To  study  the  effect  of  the 
relative  strengths  of  the  two  types  of  predation,  we  vary  the  insect-tadpole  predation 
parameters  in  fixed  proportion  while  holding  the  other  two  constant,  and  likewise  for  the 
frog-insect  parameters.  The  resulting  bifurcation  diagrams  are  presented  in  Figure  5, 
and  they  both  suggest  the  same  conclusions.  The  process  in  which  insects  move  biomass 
from  tadpoles  to  frogs  is  essential  to  stabilizing  the  system,  and  when  I-T  predation 
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Figure  4:  Bifurcation  diagrams  in  which  all  four  predation  parameters  are  varied  propor¬ 
tionally  to  study  the  overall  effect  or  predation  strength.  Different  proportions  between  the 
parameters  can  produce  either  supercritical  (top)  or  subcritical  (bottom)  Hopf  bifurcations. 
Light  gray  depicts  unstable  limit  cycles,  while  dark  gray  depicts  stable  limit  cycles. 
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is  too  weak  relative  to  F-I  predation,  the  insect  population  is  kept  low  and  cannot 
transfer  enough  biomass,  so  the  stable  fixed  point  disappears  through  a  saddle-node 
bifurcation.  Conversely,  when  the  I-T  predation  is  relatively  strong,  the  fixed  point 
will  lose  stability  to  stable  oscillations.  In  such  oscillations,  insects  eat  tadpoles  quickly 
and  deplete  the  tadpole  population,  after  which  frogs  eat  the  plentiful  insects,  depleting 
that  population,  and  then  frogs  give  birth  to  many  tadpoles,  losing  biomass  themselves 
until  the  insects  rebound.  When  these  oscillations  become  too  dramatic,  extinction  is 
the  likely  result. 

4  Model  with  carrying  capacities 

With  carrying  capacities  on  frogs  and  insects,  the  dimensional  governing  equations 
become 

T  =  -7  T  +  f3F  —  k 

1  +  /iT 

FI 

F=aT-(F(l+jrF)+VF\YT-i 

IT  FI 

1  =  'U(1  -  ip)  +  ""‘nyr  ~  XT+Vr 

4.1  Nontrivial  equilibria 

Without  predation,  a  nontrivial  equilibrium  exists  only  if  ^  >  1,  which  is  also  the 
condition  under  which  the  origin  is  unstable.  We  wish  to  study  a  parameter  regime  in 
which  the  system  has  nontrivial  behavior  before  predation  is  added,  so  we  shall  assume 
this  inequality  always  holds.  With  predation,  a  nontrivial  fixed  point  must  satisfy 

F=1Fb+^) 

These  algebraic  equations  cannot  be  solved  explicitly  for  the  equilibrium  populations, 
so  we  must  resort  to  perturbation  expansions  and  numerical  solutions. 

4.1.1  Perturbation  of  the  nontrivial  equilibrium  by  weak  predation 

Let  the  predation  strength  be  small: 

k  =  £K\  A  =  eAi, 

where  eCl.  We  can  expand  the  equilibrium  populations  in  e  (e.g.  T  ~  To+eTi+0(e2)), 
going  to  first  order  to  obtain  the  leading  order  impact  of  predation.  At  zeroth  order, 
i.e.  without  predation, 


T0=  ?(s£-l)JV 

(13) 

*b=($  ~l)N 

(14) 

h  =  M. 

(15) 
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Figure  5:  Bifurcation  diagrams  in  which  the  relative  strengths  of  the  two  types  of  predation 
are  varied.  To  study  the  variation  of  insect-tadpole  predation  strength  (top),  d  and  e  are 
varied  proportionally  (e  =  2d),  while  frog-insect  predation  is  kept  constant  (/  =  2,  g  =  2.5). 
To  study  the  variation  of  frog-insect  predation  (bottom),  /  and  g  are  varied  proportionally 
{f  =  \g),  while  insect-tadpole  predation  is  kept  constant  (d  =  2,  e  =  4).  Light  gray  depicts 
unstable  limit  cycles,  while  dark  gray  depicts  stable  limit  cycles. 
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The  first  order  coefficients  in  the  expansions  are 
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Unlike  the  system  without  carrying  capacities,  this  system  has  stable  fixed  points 
without  predation,  so  we  may  compare  populations  with  and  without  predation.  There 
are  parameter  regimes  in  which  only  amphibians,  or  only  insects,  benefit  from  the 
predation.  There  is  also  a  regime  in  which  predation  hurts  both  species,  for  instance 
when  both  species  are  bad  at  metabolizing  each  other.  This  situation  is  essentially 
a  prisoner’s  dilemma;  both  species  would  benefit  from  a  truce,  but  once  a  system  of 
reciprocal  predation  has  evolved,  it  is  hard  to  see  how  it  could  stop.  Finally,  there  is  a 
parameter  regime  where  the  predation  benefits  both  species  by  improving  the  efficiency 
with  which  the  entire  tadpole-frog-insect  system  uses  environmental  resources.  We  will 
call  this  a  regime  of  population  increase  by  mutual  predation,  or  PIMP,  and  we  now 
look  at  this  regime  in  further  detail. 


4.1.2  Population  increase  by  mutual  predation 

Suppose  the  two  types  of  predation  occur  with  relative  strengths  given  by  K  =  A .  We 
consider  turning  on  predation  with  fixed  K,  so  the  signs  of  the  total  derivatives  of  T, 
F ,  and  I  with  respect  to  k  (or  A)  dictate  whether  predation  increases  or  decreases  the 
population  biomasses  at  equilibrium.  When  e  is  small, 
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and  likewise  for  the  other  populations.  Evaluated  at  k  =  0,  just  as  predation  is  turned 
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The  signs  of  the  above  derivatives  determine  whether  predation  hurts  or  helps  a  species, 
insofar  as  its  equilibrium  biomass  decreases  or  increases.  The  regime  of  PIMP  could  be 
defined  by  the  condition  that  both  species’  biomasses  increase  with  predation,  or  by  the 
stronger  condition  that  all  three  populations’  biomasses  increase,  and  it  is  not  obvious 
which  definition  is  more  useful.  Biologically,  the  advantage  of  a  larger  population  is 
that  it  is  more  fit  because  it  is  more  genetically  diverse  and  robust  to  catastrophe.  In 
that  sense,  it  does  not  matter  what  life  stage  the  amphibians  are  in  because  one  life 


225 


stage  begets  another,  and  our  two-stage  model  is  a  simplification  anyway.  On  the  other 
hand,  it  could  still  be  catastrophic  for  an  entire  generation  of  tadpoles  to  be  wiped  out. 
We  will  thus  derive  the  conditions  for  PIMP  using  both  the  strong  and  weak  definitions. 


4.1.3  Conditions  for  strong  population  increase  by  mutual  predation 

All  three  derivatives  with  respect  to  n  are  positive,  meaning  the  equilibrium  biomasses 
of  insects  and  both  amphibian  life  stages  increase,  if  and  only  if 
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By  the  assumption  that  ^0  >1,  —  <  —  l),  so  strong  PIMP  occurs  if  and  only  if 
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The  ratio  of  predation  rates,  K ,  and  the  rate  of  reproduction,  (3,  can  be  changed  by  be¬ 
havior,  so  they  can  vary  on  much  shorter  time  scales  than  the  other  parameters,  which 
are  controlled  by  physiology.  Thus,  we  may  regard  K  and  (3  as  control  parameters.  Re¬ 
calling  that  (  =  0  —  6t,  it  is  clear  from  Equations  (13)  and  (14)  that  without  predation, 
changing  the  birth  rate  cannot  increase  frog  and  tadpole  populations  simultaneously; 
only  predation  can  do  that.  There  exists  an  interval  of  K  in  which  strong  PIMP  occurs 
precisely  when 

^(2^_1) <mF-  (17) 

For  this  inequality  to  hold,  it  is  necessary  that  metabolic  losses  not  be  too  large,  and 
also  that  a  <  7  and  0  >  That  is,  predation  can  only  increase  all  three  equilibrium 
biomasses  when  frogs  gain  biomass  through  their  interaction  with  the  external  environ¬ 
ment,  while  tadpoles  lose  it.  The  biological  interpretation  of  this  parameter  regime  is 
that  frogs  are  more  fit  for  their  environment  than  tadpoles.  Indeed,  one  could  expect 
this  to  be  true;  there  are  morphogenetic  tradeoffs  between  juvenile  and  adult  fitness, 
and  since  frogs  spend  the  majority  of  their  lives  in  the  adult  stage,  it  is  likely  that  they 
would  evolve  to  be  maximally  fit  as  adults.  It  is  less  clear  that  tadpoles  would  be  so 
unfit  as  to  lose  biomass  without  a  constant  input  from  reproduction,  but  this  is  certainly 
feasible  in  harsh  environments  that  create  high  juvenile  mortality.  So,  in  such  a  regime, 
predation  can  increase  all  three  biomasses  by  increasing  the  ratio  of  frog  biomass  to 
tadpole  biomass,  with  insects  profiting  as  middlemen. 


4.1.4  Conditions  for  weak  population  increase  by  mutual  predation 

Let  A  =  T  +  F,  the  total  amphibian  biomass.  Without  predation,  the  equilibrium 
amphibian  biomass  is 


A0  = 


a0 

l{0  ~  £f) 


N. 


There  is  an  optimal  birth  rate,  0 ,  that  maximizes  Aq  .  In  the  strong  definition  of  PIMP 
there  is  no  such  way  to  define  an  optimal  0  since  there  is  always  a  tradeoff  between  T 
and  F.  When  /3  is  above  its  optimal  value,  too  much  of  the  frogs’  biomass  is  going  into 
tadpoles,  who  are  less  fit  than  the  frogs.  When  (3  is  below  its  optimal  value,  the  frog 
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population  becomes  so  large  than  environmental  pressures  make  frogs  less  successful 
than  tadpoles. 

The  dependence  of  equilibrium  amphibian  biomass  on  weak  predation  is 
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Weak  predation  increases  the  equilibrium  values  of  both  A  and  I,  thus  satisfying  the 
definition  of  weak  PIMP,  if  and  only  if 
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and  clearly  such  a  K  exists  if  and  only  if 
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This  condition  is  harder  to  interpret  biologically  than  the  stronger  condition  of  Equa¬ 
tion  (eq:  strong  PIMP  necessary) ,  but  it  has  roughly  the  same  necessary  conditions;  in 
the  parameter  regime  for  which  predation  improves  efficiency,  the  metabolisms  cannot 
be  too  inefficient,  frogs  cannot  be  too  unfit,  and  tadpoles  cannot  be  too  fit. 

Adding  predation  is  like  decreasing  the  rate  of  reproduction  in  that  it  transfers 
biomass  from  tadpoles  to  frogs,  though  it  is  certainly  not  identical.  We  have  not  fully 
explored  the  relationship  between  optimal  /3  and  optimal  A',  but  this  would  be  a  good 
topic  for  future  work.  For  instance,  we  would  like  to  know  whether  Equation  (18) 
can  hold  when  the  reproduction  rate  is  at  its  no-predation  optimal  value,  and  how  the 
optimal  reproduction  rate  changes  in  the  presence  of  predation. 


4.1.5  Beyond  weak  predation 


We  have  seen  analytically  that  there  is  a  parameter  regime  in  which  PIMP  occurs  when 
predation  is  weak,  but  we  would  like  to  know  whether  increasing  the  strength  of  the 
predation  will  increase  the  equilibrium  populations  indefinitely.  We  have  not  tackled  this 
question  analytically,  but  in  all  numerical  experiments  the  populations  reach  a  maximum 
value  before  decreasing  as  predation  is  strengthened  further.  To  use  MATCONT,  we 
prefer  to  work  with  dimensionless  parameters,  so  we  apply  the  nondimensionalization 
of  Equation  (4),  the  same  one  used  for  the  system  without  carrying  capacities. 
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where  M  and  N  have  been  nondimensionalized  in  the  same  way  as  I  and  F,  respectively. 

In  terms  of  dimensionless  quantities,  the  regime  of  strong  PIMP  given  by  Equa¬ 
tion  (16)  becomes 
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An  example  case  that  falls  in  this  regime  has  (a,  b,  c )  = 
predation  parameters  in  fixed  proportion  such  that  e  = 
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displays  the  variation  of  equilibrium  populations  with  predation  strength.  Starting 
with  no  predation  and  increasing  the  predation  parameters  in  fixed  proportion,  we 
initially  see  increases  in  all  three  equilibrium  populations,  as  predicted  by  the  asymptotic 
analysis,  followed  by  decreases  in  all  three  populations.  Insect  population  appears  to 
go  to  zero  as  predation  goes  to  infinity,  while  frog  and  tadpole  populations  appear  to 
decrease  asymptotically  to  the  same  finite  value.  Although  it  might  be  possible  for  the 
amphibians  to  predate  the  insects  to  extinction,  it  is  not  in  their  interest  to  do  so;  this 
strategy  does  not  maximize  their  equilibrium  population. 


Figure  6:  Equilibrium  populations  as  predation  strength  is  varied  in  the  strong  PIMP 
regime.  Fixed  parameters  parameters  are  (a,  b,  c )  =  (2, 1, 1)  and  (M,  N)  =  (1, 1).  Predation 
parameters  are  varied  in  constant  proportion  with  e  =  2d,  f  =  d,  and  g  =  4 d. 


4.2  Numerical  exploration  of  parameter  space 

We  have  searched  for  bifurcations  in  parameter  space  using  MATCONT,  but  we  have  not 
found  any  in  regimes  where  PIMP  occurs;  we  simply  observe  the  equilibrium  changing 
its  coordinates  while  remaining  stable.  In  other  parameter  regimes  we  find  supercritical 
Hopf  bifurcations  but  no  saddle-node  bifurcations.  The  Hopf  bifurcations  observed  in 
this  system  were  of  a  different  character  than  those  in  the  system  without  carrying 
capacities.  In  that  system,  continuing  in  parameter  space  past  a  Hopf  bifurcation  typi¬ 
cally  led  to  growing  oscillations  and,  ultimately,  extinction.  In  the  system  with  carrying 
capacities,  a  limit  cycle  may  appear  and  grow  as  we  move  through  a  Hopf  bifurcation, 
but  as  we  continue  through  parameter  space,  it  ultimately  shrinks  and  disappears  back 
into  a  stable  equilibrium.  So,  it  seems  limit  cycles  occur  only  in  isolated  regions  of 
parameter  space  where  certain  resonances  are  strong.  This  significant  qualitative  dif- 
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ference  from  the  system  without  carrying  capacities  is  due  to  the  damping  effect  of  the 
carrying  capacities. 


5  Conclusions 

In  the  model  without  carrying  capacities,  we  examined  the  regime  in  which  the  insect 
and  frog-tadpole  systems  decay  without  predation.  Predation  can  create  stable  fixed 
points  and  limit  cycles,  though  the  limit  cycles  are  not  robust  and  rather  unrealistic 
biologically,  so  in  this  model  the  two  species  are  completely  codependent.  For  this 
stabilization  to  occur,  predation  must  be  above  a  certain  minimum  strength,  and  the 
relative  strengths  of  insect-tadpole  and  frog-insect  predation  must  be  within  a  certain 
interval.  There  are  biological  parameters  for  which  a  fixed  point  and  a  limit  cycle  are 
simultaneously  stable,  but  we  have  not  found  parameters  for  which  nontrivial  stable 
fixed  points  coexist. 

In  the  model  with  carrying  capacities,  both  species  exist  stably  without  predation. 
Depending  on  the  biological  parameters,  predation  may  increase  or  decrease  the  equi¬ 
librium  populations  in  any  combination.  Fixing  all  biological  parameters  other  than 
predation  rates,  we  have  seen  that  mutual  predation  can  increase  all  of  the  equilibrium 
populations  simultaneously.  However,  there  always  exists  an  optimal  rate  of  reproduc¬ 
tion  that  maximizes  the  equilibrium  biomass  of  amphibians,  and  we  have  not  determined 
whether  predation  can  only  increase  all  populations  simultaneously  by  effectively  de¬ 
creasing  the  rate  of  reproduction  toward  its  optimal  value.  If  this  is  the  case,  we  must 
ask  whether  it  is  biologically  realistic  for  the  amphibians  to  habitually  reproduce  at  a 
non-optimal  rate,  and  if  not,  we  would  conclude  that  population  increase  by  mutual 
predation  is  not  biologically  relevant  in  the  long  term.  This  is  both  a  mathematical  and 
biological  question  for  future  study. 
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7  Appendix:  The  Holling  type  II  functional  form 

The  simplest  system  in  which  the  Holling  type  II  functional  form  could  appear  is  a  2D 
predator-prey  model.  However,  the  justification  for  using  this  specific  form  in  such  a 
model  is  not  clear  a  priori.  A  quadratic  predation  term  that  is  proportional  to  both 
predator  and  prey  populations  would  not  require  much  justification  since  it  represents 
leading  order  behavior  at  the  very  least,  but  such  a  term  produces  unrealistic  exponential 
growth  in  the  system.  However,  building  on  the  work  of  Whitehead  and  Doering  [4],  we 
can  create  a  dichotomy  between  hungry  predators  sated  predators  and  use  only  linear 
and  quadratic  terms  in  the  resulting  3D  system,  and  then  find  that  in  the  appropriate 
limit  it  reduces  to  a  2D  predator-prey  system  with  Holling  type  II  predation  laws.  The 
functional  forms  of  the  3D  model  do  not  require  justification  since  they  are  the  simplest 
laws  that  could  capture  the  necessary  behavior.  To  evaluate  the  accuracy  of  the  reduced 
system  with  Holling  type  II  laws,  we  shall  compare  its  features  to  those  of  the  full  3D 
system. 
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7.1  The  3D  system 

The  3D  system  consists  of  hungry  predator  ( H ),  sated  predators  (S),  and  prey  (P): 

H  =  —(f>PH  +  gS  -  SH  +  (3S 
S  =  (j)PH  -gS-dS 
P=-0PJf  +  7P(  l-£). 

Hungry  predators  eat  prey  at  rate  (f>  and  become  sated  predators,  while  sated  predators 
metabolize  and  become  hungry  at  rate  fi.  Both  hungry  and  sated  predators  die  at  rate 
6,  and  sated  predators  give  birth  to  hungry  ones  at  rate  (3  >  5.  The  prey  is  being 
born  according  to  a  logistic  law.  In  a  more  accurate  model,  the  (j>  and  5  constants  in 
the  different  equations  could  take  different  values,  but  this  would  only  add  parameters 
without  changing  any  qualitative  results. 

To  reduce  the  order  of  the  system,  we  will  consider  the  limit  of  large  metabolism 
rate,  i.e.  /i  ^  (3,6 ,7.  To  see  how  each  variable  should  scale  with  /1,  we  consider  the 
three-species  equilibrium, 


(H0,So,Po)=(141£s,‘¥1£s). 

At  equilibrium,  there  is  O(g)  more  prey  than  predators.  This  motivates  us  to  scale  the 
prey  variable  in  /i,  so  we  nondimensionalize  the  system  by 

H  1— >  | h  S  1— >  6-s  Pm  ^ X  t  hm  jt. 

The  nondimensional  3D  system  is 
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We  shall  consider  the  singular  limit  where  a  predator  eats  many  times  in  its  life,  i.e. 
f<Cl.  The  parameter  b  is  the  ratio  of  predator  birthrate  to  death  rate,  g  is  the  ratio 
of  prey  birthrate  to  predator  death  rate,  and  N  is  the  dimensionless  carrying  capacity 
of  the  environment  for  prey.  Note  that  predator  and  prey  populations  are  in  different 
units,  so  their  numerical  values  can  not  be  meaningfully  compared. 

7.2  Reduced  systems 

The  3D  system’s  behavior  can  only  be  well  approximated  by  a  two  dimensional  system 
if  its  behavior  is  roughly  two  dimensional,  for  instance  this  implies  that  it  must  not  be 
chaotic.  There  must  be  a  two-dimensional  slow  manifold  in  phase  space  on  which  all 
solutions  approximately  lie,  possibly  after  some  transient  behavior  as  the  component 
of  the  solution  on  the  fast  manifold  rapidly  decays.  Reduction  of  order  is  achieved  by 


=  —-Xh  +-s  —  h+bs 
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=  - Xh s  —  s 

e  e 

=  -Xh  +  gX(  l-£), 
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projecting  the  full  system  onto  the  slow  manifold.  To  find  the  slow  manifold,  we  observe 
that  the  system  is  linear  in  h  and  s  if  X  is  regarded  as  a  known  function  of  time: 


The  eigenvalues  of  this  system  are 


A± 


11+1 

e  2 


—  1±1X+1 
e  2 


4  bX 

W+W 


To  first  order  in  e, 

^  =  -1  +  3TTT  +  0<') 


The  A+  eigenvalue  is  0(1),  while  the  A_  eigenvalue  is  negative  and  0(1),  so  solutions 
decay  quickly  along  the  direction  of  the  A_  eigenvector  and  move  more  slowly  in  the 
A+  direction.  Thus  at  a  given  X ,  the  A_  vector  is  tangent  to  the  fast  manifold,  and  the 
A+  vector  is  tangent  to  the  slow  manifold. 


7.2.1  Full  2D  system 

The  A+  eigenvector  yields  a  proportionality  between  s  and  h  on  the  slow  manifold  as  a 
function  of  X, 

S=  2(1  + eft)  (^-1+\/(^  +  l)2+4 ebx)h. 

This  relation  may  be  used  to  reduce  the  dynamical  system  by  eliminating  either  h  or 
s,  but  since  we  are  ultimately  concerned  with  the  total  number  of  predators,  we  define 
Y  =  h  +  s  and  work  in  this  variable.  Applying  the  above  s(h)  relation  to  the  3D  system, 
we  obtain  our  2D  reduction, 


X  =  gX{  1 
Y  =  -Y  + 


_ 2(1  +  eb)XY _ 

N  X  +  1  +  y/(X  +  l)2  +  4  ebX  +  2  eb 
(X  -  1  +  y/(X  +  l)2  +  4ebX)bY 
X  +  1  +  y/(X  +  1)2  +  AebX  +  2eb' 


7.2.2  O(e)  2D  system 


If  we  approximate  the  s(h)  relation  to  O(e),  we  obtain  a  simpler  relation  that  still 
captures  some  affects  of  finite  e, 

s  =  (X  -  ej^x)h  +  0(e2). 

The  O(e)  truncated  reduced  system  is 


X  =  gX(l-  §)- 
bXY 


XY  __  bX3Y 
N>  1  +  X  ~e(X  +  l)3 


Y  =  —Y 


b2X2Y 
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7.2.3  Holling  type  II  2D  system 

If  we  truncate  further  and  retain  only  the  0(1)  term  in  e, 

s  =  hX  +  O(e). 


Using  this  simple  proportionality  we  recover  the  Holling  type  II  functional  form. 


1  =  gX(  1  -  § )  - 


Y  =  -Y  + 


bXY 
1  +  X' 


XY 

1  +  X 


The  form  goes  by  various  other  names  in  other  fields,  such  as  the  Jacob-Monod 
form  in  microbiology  or  the  Michaelis-Menten  form  in  enzyme  kinetics. 


7.3  Comparison  of  system  behaviors 

To  understand  what  has  been  lost  by  projecting  onto  the  slow  manifold,  as  well  as 
by  truncating  in  e,  one  must  compare  the  behavior  of  the  full  3D  system  with  the 
behaviors  of  the  reduced  2D  systems.  There  is  no  unique  measure  of  the  quality  of  the 
approximation;  different  properties  are  approximated  better  than  others,  so  the  value 
of  the  approximation  depends  ultimately  on  what  properties  are  of  interest. 

For  the  3D  system,  we  ignore  the  separate  dynamics  of  the  h  and  s  variables  and 
consider  only  their  sum,  Y,  because  this  is  the  quantity  of  interest  and  the  one  that 
compares  directly  to  the  2D  models.  As  we  demonstrate  below,  all  four  models  have 
the  same  qualitative  behavior,  some  representative  phase  portraits  of  which  are  given  in 
Figure  7.  There  is  a  trivial  equilibrium  at  the  origin  representing  mass  extinction,  but 
it  is  always  unstable.  There  is  a  prey-only  equilibrium,  stable  only  when  b  and  N  are 
small.  That  is,  when  predators  are  not  born  too  fast,  and  saturation  population  of  prey 
is  not  too  large,  both  factors  that  would  inhibit  predator  success.  When  b  and  N  are  a 
bit  larger  so  conditions  are  a  bit  better  for  predators,  the  prey-only  equilibrium  becomes 
unstable  as  a  stable  two-species  equilibirum  becomes  physical  and  splits  off  from  it  in 
a  transcritical  bifurcation.  The  two-species  equilibrium  is  a  plain  sink  initially  and 
becomes  a  spiral  sink  at  larger  b  and  N.  When  b  and  N  are  increased  further  still, 
the  two-species  equilibrium  undergoes  a  Hopf  bifurcation,  losing  its  stability  to  a  limit 
cycle.  All  orbits  are  bounded  for  all  parameter  values.  Since  all  four  systems  share 
this  qualitative  picture,  the  effects  of  approximation  appear  only  in  the  quantitative 
differences  between,  say,  equilibria  and  limit  cycle  locations,  or  bifurcation  values. 


7.3.1  Equilibria 

At  all  parameter  values,  all  four  systems  have  equilibria  at  the  origin  and  at  (X,  Y)  = 
( N ,  0).  Solving  for  the  nontrivial  two-species  equilibria,  we  obtain  Yq  =  r^j(l  —  for 
all  four  systems,  though  the  Xo  value  may  vary  between  systems.  Clearly  the  nontrivial 
equilibrium  is  only  physical  (Ij)  >  0)  when  X0  is  less  than  N,  the  prey-only  saturation 
population.  That  is,  such  equilibria  never  represent  mutualistic  solutions.  The  Xo 
values  for  the  two-species  equilibria  in  each  system  are  tabulated  in  Table  2.  The  Xo 
value  for  the  0(e)  system  is  more  cleanly  expressed  implicitly  by  the  cubic  equation 
of  which  it  is  the  only  positive  real  root.  The  full  2D  system  has  exactly  the  same 
equilibrium  as  the  3D  system,  while  the  truncated  systems  have  different  two-species 
equilibria,  which  converge  to  the  3D  value  as  e  — >  0. 
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Figure  7:  Sample  phase  portraits  representing  all  four  possible  qualitative  behaviors,  con¬ 
sidering  N  as  the  control  parameter:  stable  prey-only  equilibrium  (upper  left),  stable  two- 
species  equilibrium  with  plain  sink  (upper  right),  stable  two-species  equilibrium  with  spiral 
sink  (lower  left),  and  a  limit  cycle  around  the  two-species  equilibrium  (lower  right).  These 
portraits  were  generated  by  integrating  the  3D  system  with  b  =  2,  g  =  1,  and  e  =  0.5. 
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Table  2:  Locations  of  the  two-species  equilibria  in  the  four  models. 


Model 
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7.3.2  Linear  stability 

To  analyze  the  linear  stability  of  the  three  equilibria  in  all  four  systems,  we  linearize 
each  system  about  an  arbitrary  point,  (ho,  Sq,  X0),  or  (X0,lo). 
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About  the  origin,  every  linearized  system  has  an  eigenvalue  of  g ,  which  is  positive, 
so  the  extinction  equilibrium  is  always  unstable.  Linearizing  about  the  prey-only  equi¬ 
librium,  one  finds  that  each  system  goes  unstable  according  to  the  same  parameter 
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inequalities  that  determine  when  the  two-species  equilibrium  exists.  This  is  as  expected 
from  a  transcritical  bifurcation;  the  stability  of  the  prey-only  equilibrium  changes  pre¬ 
cisely  when  it  collides  with  the  two-species  equilibrium,  which  is  also  the  moment  when 
that  equilibrium  becomes  physical.  As  for  the  two-species  equilibrium,  we  can  study  its 
stability  analytically  in  the  Holling  type  II  system,  but  the  other  linearized  systems  are 
messy,  so  we  solve  their  stability  eigenproblems  numerically. 


7.3.3  Stability  of  the  two-species  equilibrium  in  the  Holling  type  II 
2D  system 


The  Holling  type  II  system  linearized  about  its  two-species  equilibrium  is 


d_ 

dt 


(“) 


'  3  (\ _ b+1 _ 3 

H  N(b-l)J 


The  stability  of  the  two-species  equilibria  is  not  hard  to  compute  analytically  for  the 
Holling  type  II  system.  The  characteristic  equation  of  the  linearized  system  is 

A2  “  b  (1  ~  Nb(b-1)  )  A+f(5_1_F)=°- 

For  the  two-species  equilibrium  to  exist,  the  0(1)  coefficient  of  the  characterisitc  equa¬ 
tion  must  be  positive.  Thus,  solving  the  quadratic  equation  for  A,  the  discriminant  will 
either  be  imaginary  or  of  smaller  magnitude  than  the  O(A)  coefficient.  Either  way,  both 
eigenvalues  will  be  negative  (the  equilibrium  will  be  stable)  if  and  only  if  the  0(A)  co¬ 
efficient  is  positive,  i.e.  when  N  <  When  N  exceeds  this  value,  all  three  equilibria 
are  unstable.  We  later  prove  that  all  orbits  are  bounded,  so  the  Poincare-Bendixon 
theorem  will  guarantee  that  the  system  converges  to  a  limit  cycle. 


7.3.4  Bifurcations 

The  transcritical  bifurcation  occurs  when  Yq  exceeds  zero,  which  occurs  when  the  Xo 
expressions  reported  in  Table  2  are  less  than  N.  For  each  system,  this  is  possible  only 
when  b  >  1.  The  exact  relation  between  b  and  N  at  the  bifurcations  are  given  in  Table 
3.  Note  that  from  a  point  in  parameter  space  where  the  prey-only  equilibrium  is  stable, 
the  bifurcation  may  be  produced  by  increasing  either  b  or  N.  The  point  in  parameter 
space  where  the  Hopf  bifurcation  occurs  has  a  simple  analytic  expression  for  the  Holling 
type  II  model,  so  this  is  also  given  in  Table  3.  For  the  other  models,  the  N  at  which  the 
Hopf  bifurcation  occurs  was  computed  numerically  for  given  values  of  6,  g  and  e,  and 
some  representative  results  are  plotted  in  Figure  8.  The  3D  transcritical  bifurcation 
depends  only  on  6,  N  and  e,  while  the  Hopf  bifurcation  depends  also  on  g,  but  quite 
weakly  so.  It  is  clear  from  Figure  8  that  the  full  2D  model  captures  the  transcritical 
bifurcation  perfectly,  while  the  truncated  models  are  inaccurate  when  e  becomes  large. 
At  the  Hopf  bifurcation,  the  full  2D  model  captures  the  3D  behavior  imperfectly,  but 
again  much  better  than  the  truncated  models. 


7.3.5  Lyapunov  stability 

Each  system  undergoes  only  the  two  bifurcations  we  have  studied  and  has  no  other 
fixed  points.  All  that  remains  is  to  verify  that  in  each  system  the  orbits  are  bounded 
for  all  parameter  values.  We  shall  do  this  by  the  Lyapunov  method  for  the  3D  system 
and  the  Holling  type  II  2D  system.  We  shall  not  prove  boundedness  for  the  other  two 
systems,  whose  algebraic  nonlinearities  would  make  it  a  cumbersome  task,  but  we  can 
feel  confident  in  its  veracity. 
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Table  3:  Bifurcation  points  of  the  four  models.  For  the  0(e)  model,  Xq  is  defined  implicitly 
by  the  formula  given  in  Table  2. 


Model 

Transcritical  bifurcation 

Hopf  bifurcation 

3D 

0  <  jgf  <  N 

Found  numerically 

Full  2D 

0  <  <N 

O(e)  2D 

0  <  Xo  <  N 

Holling  type  II  2D 

«<X<N 

gfXv 

Holling  type  II  2D  system  Examining  the  X  equation,  we  see  that  X  <  0 
whenever  X  >  N.  Thus,  if  X  <  IV  at  the  initial  condition,  it  remains  true  for  all  time. 
To  put  an  upper  bound  on  Y  that  is  valid  for  all  parameters,  we  must  consider  X  and 
Y  together  in  a  Lyapunov  functional. 

Let  L  =  X  +  jY .  The  proportionality  between  X  and  Y  is  chosen  such  that  the 
nonlinear  Holling  type  II  terms  in  L  cancel: 

L  =  gX(l-§)-lY 


Our  goal  is  to  bound  L  by  an  affine  function  of  L,  i.e.  L  <  a  —  (3L,  where  (3  >  0.  This 
will  imply  that  L  <  %  for  all  time  if  it  is  true  initially.  To  bound  L  by  such  a  term  we 
must  bound  the  quadratic  X  term  by  an  affine  function  of  X  with  a  negative  coefficient 
on  X.  Any  line  tangent  to  the  parabola  at  X*  >  ^  will  suffice,  but  we  seek  the  line 
that  minimizes  thereby  providing  the  optimal  bound  on  L.  An  arbitrary  tangent 
line  gives  the  bound 
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Assuming  the  above  infonum  occurs  at  an  X*  such  that  —  1  <  \ ,  the  optimal  choice 
of  X*  in  fact  contradicts  the  assumption  when  b  >  1.  So,  the  minimum  must  be 
meaning  that  X*  >  X  (f  _|_  I)  >  jy.  The  bound  on  L  then  becomes 
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x,>f(i+L 
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Putting  this  in  terms  of  X  and  Y,  and  adding  the  known  bound  on  X  alone, 

X  <  min  j  jpdl  +  b)2  —  \Y,N |. 


3D  system  Let  L  =  h  +  as  +  f3X.  To  make  the  hX  terms  to  vanish  from  L,  we  let 
/3  =  which  clearly  requires  a  >  1  for  (3  to  be  positive.  Thus, 


L  =  —  [T  +  1  —  (  \  +  b) / a\as  —  h+  -(a  —  l)gX(l  -  y) 
<  -m(a)L+  i(a-  l)X[(g  +  m{aj)  -  ^] , 
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Figure  8:  Values  of  N  as  a  function  of  e  at  the  transcritical  and  Hopf  bifurcations  for  all 
four  systems  with  6  =  2  and  g  =  1.  For  the  transcritical  bifurcation,  the  full  2D  systems 
coincides  with  the  3D  system. 


where  m(a)  =  min  {1,^  +  1  —  (-  +  6) /a}.  We  now  bound  the  quadratic  X  term  by  its 
maximum  value  without  bothering  to  find  the  optimal  bound  linear  in  X. 

L  <  -m{a)L+  1  )(g  +  m{a))2. 

This  yields  a  bound  on  L  that  can  be  optimized  over  all  allowable  a: 

L<  inf  ~r~  a7\  (g  +  m(a))~ . 
a>l  4£9  m(“)  v  '> 


Assuming  the  infimum  occurs  when  m(a)  =  1  implies  that  the  optimal  bound  is  obtained 
by  choosing  a  =  1+,  which  contradicts  m(a )  =  1.  Let  a  >  giving  our  best  result 

for  a  bound  on  the  Lyapunov  functional: 
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The  optimal  bound  can  be  calculated  given  the  other  parameters,  but  the  more  impor¬ 
tant  conclusion  is  that  some  such  finite  bound  always  exists  for  L. 


7.3.6  Summary  of  results 

Our  analysis  strongly  suggests  that  X  and  Y  have  the  same  qualitative  behavior  in  all 
four  systems,  though  to  make  this  result  rigorous,  one  needs  Lyapunov  bounds  on  the 
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full  2D  and  O(e)  2D  systems,  and  one  needs  to  show  analytically  that  a  Hopf  bifurcation 
occurs  in  all  systems  as  it  does  in  the  Holling  type  II  system.  Trusting  that  the  systems 
indeed  all  have  the  same  behavior,  they  differ  only  quantitatively.  We  have  seen  that 
the  full  2D  system  has  the  same  two-species  equilibrium  as  the  3D  system,  while  the 
truncated  systems  do  not,  and  quantitative  differences  in  bifurcation  values  were  shown 
already  in  Figure  8. 

Phase  portraits  produced  by  the  different  models  appear  in  Figure  9.  Although  the 
2D  systems  began  with  the  same  initial  conditions,  we  must  compare  them  each  to 
their  own  corresponding  3D  solution  because  they  each  correspond  to  slightly  different 
decompositions  of  Y0  into  ho  and  So-  However,  the  three  3D  solutions  tend  to  be  quite 
similar.  The  top  row  of  Figure  9  shows  solutions  at  small  N,  when  the  prey-only 
equilibrium  is  stable.  The  prey-only  equilibrium  is  identical  in  all  four  systems,  so  the 
phase  portraits  agree  well  even  for  e  =  0.5.  The  middle  row  of  Figure  9  shows  solutions 
for  larger  N,  when  the  two-species  equilibrium  is  stable.  The  different  systems  agree 
well  when  e  is  0.05,  but  at  0.5  the  locations  of  the  equilibria  differ  significantly,  so  the 
respective  phase  trajectories  spiraling  towards  them  are  quantitatively  quite  different. 
The  bottom  row  of  Figure  9  shows  long-time  solutions  at  still  larger  N.  When  e  is  0.05, 
the  limit  cycles  of  the  full  2D  and  0(e)  2D  systems  approximate  the  3D  limit  cycle 
quite  well,  while  the  Holling  type  II  system  does  a  bit  worse.  When  e  is  0.5,  the  Holling 
type  II  system’s  limit  cycle  is  much  too  large,  the  full  2D  system’s  is  too  small  but  a 
bit  better,  and  the  O(e)  2D  system  has  not  yet  gone  through  the  Hopf  bifurcation. 

The  full  2D  system  approximates  the  3D  system  well  for  e  <  0.5,  while  the  Holling 
type  II  2D  system  is  quantitatively  accurate  only  when  e  is  an  order  of  magnitude 
smaller.  We  wish  to  extrapolate  these  truths  to  other  models  where  the  Holling  type 
II  or  full-order-in-e  functional  forms  might  be  used  as  predation  laws  without  repeating 
their  rigorous  derivation  from  a  higher-order  dynamical  system.  If  e  is  very  small, 
or  if  one  is  only  concerned  with  qualitative  features,  as  is  often  the  case  in  biological 
modeling,  the  Holling  type  II  functional  form  is  certainly  satisfactory.  If  e  is  closer 
to  unity,  and  the  quantitative  properties  of  the  system  matter,  as  is  often  the  case  in 
enzyme  kinetics,  the  full-order-in-e  functional  form  would  be  a  better  choice.  The  O(e) 
functional  form  probably  offers  neither  enough  simplicity  nor  accuracy  to  be  chosen 
over  the  other  two. 
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e  =  0.05,  N  =  5 


X  (prey) 


Figure  9:  Phase  portraits  of  all  four  systems  for  values  of  N  with  e  =  0.05  (left)  and  e  =  0.5 
(right),  and  b  =  2,  g  =  1.  All  solutions  began  at  (2,2),  but  only  the  late-time  behavior 
is  shown  in  the  bottom  two  plots  to  make  the  limit  cycles  clear.  The  asterisks  are  the 
equilibria  of  the  different  systems,  which  always  coincide  for  the  3D  system  and  the  full  2D 
system. 
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1  Introduction 

Vast  verity  of  geophysical  flows  occur  in  density  stratified  fluids  in  a  presence  of  sheared 
mean  velocity  profiles.  Under  such  conditions  it’s  been  observed  that  vigorous  turbulence 
leads  to  generation  of  mixed  layers  within  the  fluid,  e.g.  the  regions  of  homogeneous  density 
separated  from  each  other  by  strong  density  gradients  (interfaces).  These  layers  could  be  of 
large  vertical  scales  as  well  as  of  fine  scales.  An  example  of  such  flows  is  the  destratification 
of  the  upper  layer  of  the  ocean  under  the  action  of  the  surface  wind  stress  and  the  formation 
of  multiple  mixed  layers  which  are  observed  within  the  seasonal  thermocline  in  the  ocean 
as  well  as  in  fresh  lakes  [10].  Such  structures  in  a  turbulent  flows  are  thought  to  play 
a  crucial  role  in  the  determining  the  turbulent  fluxes  of  momentum,  buoyancy  and  other 
tracers  which  in  turn  affect  the  further  dynamics  of  the  flow.  Nonetheless,  the  problem  of 
turbulent  transport  remains  one  the  most  difficult  ones  in  geophysics. 

Previous  studies  investigated  mechanisms  acting  in  stratified  turbulent  flows  -  a  review 
of  those  could  be  found  in  [4],  The  formation  of  small  scale  mixed  layers  in  stratified  fluids 
driven  by  external  mechanical  sources  of  turbulence  were  investigated  by  theoretically  by 
Balmforth  et  al  [1],  and  experimentally  by  Park  et  al  [6].  In  the  present  study  the 
dynamics  of  layers  is  explored  within  a  turbulent  fluid  in  a  presence  of  a  sheared  mean 
flow.  A  set  of  experiments  were  performed  in  which  a  linearly  stratified  fluid  is  driven 
by  the  stress  applied  by  a  rotating  horizontal  disk  at  the  surface.  This  setting  leads  to 
the  formation  of  the  surface  mixed  layer  extending  from  the  the  disk  into  the  interior  of 
the  fluid.  Its  evolution  with  time  and  its  dependence  of  the  stratification  and  the  rotation 
of  the  disk  are  obtained.  Furthermore,  small  scale  mixed  layers  were  observed  in  below 
the  surface  mixed  layer.  They  have  a  complicated  dynamics  with  multiple  regimes,  which 
were  attempted  to  characterize.  The  description  of  the  experiment  and  the  behavior  of 
the  observed  mixed  layers  is  presented  in  section  (2).  Further,  a  phenomenological  model  is 
developed  that  explains  the  mechanism  under  which  the  secondary  mixed  layers  are  formed. 
It  parametrized  the  action  of  the  two  mechanisms:  shear  generation  of  the  turbulence  due 
to  Kelvin-Helmholtz  type  of  instabilities,  and  the  mean  flow  generated  turbulence  though 
the  vortex  scraping  mechanism.  The  resulting  gradient-type  mixing  model  predicts  the 
formation  of  the  instabilities  in  a  particular  parameter  regime  that  lead  to  mixed  layers 
formation.  The  instabilities  are  related  to  those  described  by  Phillips  [7]  and  Posmentier 
[8].  The  model  description,  its  analysis  and  numerical  simulations  are  presented  in  section 
(3).  The  summary  of  the  work  is  in  section  4. 
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Figure  1:  Experimental  set-up:  cylindrical  tank  filled  with  stratified  fluid,  rotating  disk  at 
the  surface  of  the  fluid,  conductivity  probe 

2  Experiments 

A  series  of  experiments  were  made  where  a  stratified  fluid  was  forced  by  a  rotating  horizontal 
disk  at  the  surface  with  the  objective  to  investigate  the  dynamics  of  the  formed  mixed  layers 
under  various  cases  of  stratification  and  the  disk  rotating  speeds.  The  experimental  setting 
is  similar  to  that  used  by  Davies  et  al  [3],  who  investigated  the  flow  generated  during  the 
spin  up  process  and  further  established  secondary  circulation. 

2.1  Experiment  arrangement 

Experimental  set-up  is  shown  on  Figure  1.  A  cylindric  tank  of  30  cm  height  and  30  cm 
diameter  was  filled  with  a  salt  stratified  water.  Only  linear  stratifications  were  considered, 
which  were  obtained  with  a  standard  double  bucket  technique  [5].  The  range  of  density 
variations  within  the  fluid  was  from  2%  to  20%.  A  horizontal  disk  with  the  diameter  of 
24  cm  is  located  at  the  surface  of  the  fluid,  with  its  axis  of  rotation  aligned  with  the  axis 
of  the  cylinder.  The  rotation  speed  of  the  disk  is  controlled  by  the  motor  (attached  above 
the  disk)  that  gives  a  range  of  0.5  -  10  rad/s.  Measurements  of  vertical  density  profiles 
were  obtained  using  a  conductivity  probe  which  was  taking  100  measurements  per  1  cm  of 
fluid,  moving  vertically  with  a  speed  of  3  mm/s.  Measurements  were  taken  only  when  the 
probe  was  moving  down  to  avoid  fluxes  of  water  along  the  probe  that  would  contaminate 
the  data.  The  probe  Reynolds  number  is  of  the  order  of  100  (  this  is  an  upper  bound,  which 
is  reached  at  the  edge  of  a  disk);  thus,  measurements  are  not  considerably  affected  by  the 
presence  of  the  probe  on  a  scales  considered  in  the  current  study  -  0{cm).  The  profiled 
depth  of  the  fluid  was  20  cm,  whereas  the  height  of  it  is  27 cm.  Density  was  calculated  based 
on  conductivity  with  a  calibration  using  a  3rd  order  polynomial  as  a  mapping  function.  To 
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Figure  2:  Images  of  the  typical  interfaces  between  the  well  mixed  layer  and  a  linearly 
stratified  layer  below  it. 


reduce  temperature  effects  on  conductivity  the  fluid  allowed  to  reach  the  room  temperature 
(20°C)  before  the  start  of  the  experiment.  Thus,  the  control  parameters  that  define  the 
outcome  of  the  experiment  were  the  stratification  of  the  fluid  (expressed  as  a  Brunt -Vaisala 
frequency  N)  and  the  rotating  speed  of  the  disk  fi. 

There  are  multiple  non-dimensional  parameters  involved  in  current  experiments.  The 
most  relevant  one  is  the  Richardson  number,  which  will  be  defined  here  as  Ri  =  .  This 

definition  should  not  be  confused  with  the  local  Richardson  number  yf  o  which  varies  with 
time  and  the  location  in  the  fluid.  The  explored  range  of  Ri  G  (0.15  —  2.3).  Another 
parameter  would  be  the  disk  edge  Reynolds  number:  Re  =  ^  ~  105,  where  U  is  the 
azimuthal  velocity  at  the  edge  of  the  disk,  L  is  the  radius  of  the  disk,  and  /r  is  the  kinematic 
viscosity  of  water.  A  flow  with  such  high  values  of  Reynolds  number  could  be  considered 
turbulent  and  independent  from  the  exact  value  of  Re.  The  Schmidt  number  for  the  salt 
stratified  water  is  a  =  -  ~  700,  where  k  is  the  salt  diffusivity.  The  tank  aspect  ratio  as  well 
as  the  ration  of  the  disk  radius  to  the  height  of  the  water  column  is  ~  1.  All  geometrical 
parameters  as  well  as  the  Schmidt  number  were  fixed  throughout  all  experiments.  Thus,  the 
only  non-dimensional  parameter  that  describes  the  different  outcomes  of  the  experiments 
is  the  Richardson  number. 

2.2  Observations  of  density  profiles 

After  the  tank  is  filled  with  the  stratified  fluid  the  disk  is  set  in  motion.  As  was  mentioned 
previously,  the  disk  Reynolds  number  is  big  enough  that  the  flow  adjacent  to  it  becomes 
turbulent.  Such  flows  are  very  effective  in  diffusing  buoyancy  and  momentum.  Thus,  a 
mixed  layer  forms  below  the  disk,  with  almost  homogeneous  density  and  velocity  distribu¬ 
tions.  The  growth  of  such  a  layer  however  is  suppressed  by  the  stable  stratification  of  the 
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density  profiles 


Figure  3:  Density  profiles  plotted  for  different  times  after  the  start  of  the  experiment. 
Profiles  are  shifted  along  the  X-coordinate  by  0.03 g/cm5  from  each  other. 


fluid.  As  the  mixed  layer  deepens  the  gravitational  potential  energy  of  the  fluid  increases 
because  mixing  process  effectively  lifts  the  center  of  the  mass  of  a  water  column  by  mixing 
dense  waters  upwards  against  the  gravitational  field.  Furthermore,  mixing  in  a  stratified 
fluid  leads  to  formation  of  a  density  jump  at  the  base  of  the  mixed  layer.  The  density 
jump  suppresses  turbulence  and  effectively  separates  the  turbulent  flow  in  the  upper  mixed 
layer  from  almost  quiescent  layer  underneath  it.  Within  the  mixed  layer  there  is  a  mean 
azimuthal  flow  due  to  the  coherent  rotation  of  the  disk.  This  flow  is  also  suppresses  when  it 
reaches  strong  stratification  at  the  base  of  the  layer  creating  a  strong  shear  and  providing 
other  mechanisms  of  the  generation  of  turbulence  which  would  be  discussed  further. 

Figure  2  shows  images  of  two  interfaces.  The  top  one  appears  at  early  stages  in  the 
development  of  the  mixed  layer  and  is  characterized  by  weak  density  jump  across  it,  ener¬ 
getic  eddies  and  large  thickness  (few  cm).  The  one  shown  at  the  bottom,  forms  during  later 
stages  and  has  low  energy  eddies  of  smaller  sizes  (less  then  a  cm)  and  much  stronger  density 
jump  across  it.  The  signature  of  these  interfaces  is  clearly  present  in  the  vertical  density 
profiles,  the  time  evolution  of  which  is  shown  on  Figure  3.  Initially,  linearly  stratified  fluid 
get  mixed  from  the  top  and  forms  a  mixed  layer  as  well  as  the  density  jump  at  the  base 
of  it,  both  of  which  are  growing  in  magnitude.  The  interface  separating  the  mixed  layer 
from  almost  undisturbed  bottom  layer  changes  its  characteristics  as  well.  The  profile  taken 
at  t  =  0.2 h  (hours)  shows  the  presence  of  unstable  stratification,  which  resulted  from  the 
overturning  of  energetic  eddies  at  the  base  of  the  mixed  layer.  The  density  interface  for  this 
profile  corresponds  to  a  type  shown  on  Figure  2  (top).  At  the  profile  taken  at  t  =  6 h  it  is 
hard  to  see  the  density  structures  resulted  from  overturning  of  eddies  because  turbulence  is 
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Exp# 

N 

n 

Ri 

a 

A 

9 

0.5 

1.13 

0.2 

0.2 

0.42 

13 

0.5 

1.34 

0.14 

0.18 

0.5 

14 

0.52 

0.72 

0.52 

0.2 

0.32 

15 

1.37 

1.7 

0.65 

0.23 

0.27 

17 

1.3 

0.95 

1.87 

0.246 

0.19 

18 

1.1 

1.5 

0.54 

0.22 

0.29 

19 

1.12 

1.7 

0.43 

0.2 

0.36 

20 

1.18 

1.13 

1.09 

0.25 

0.21 

21 

1.3 

1.34 

0.95 

0.24 

0.23 

22 

0.98 

0.95 

1.05 

0.24 

0.24 

23 

0.84 

2.08 

0.16 

0.21 

0.46 

24 

0.78 

0.72 

1.18 

0.24 

0.23 

25 

1.46 

1.13 

1.67 

0.25 

0.18 

26 

1.7 

1.7 

1 

0.225 

0.22 

27 

1.7 

1.13 

2.25 

0.225 

0.24 

28 

1.72 

3.14 

0.3 

0.18 

0.41 

29 

1.6 

1.94 

0.67 

0.22 

0.29 

30 

1.42 

1.13 

1.57 

0.23 

0.22 

31 

1.5 

1.18 

1.6 

0.24 

0.22 

32 

0.94 

1.5 

0.4 

0.23 

0.29 

Table  1:  The  table  shows  the  experiment  settings:  initial  stratification  of  the  fluid  (N),  the 
disk  angular  velocity  Q.  The  last  two  columns  show  the  coefficients  of  the  power  law  fit  for 
evolution  of  the  mixed  layer  depth  h  =  Ata.  The  values  are  organized  and  sorted  by  the 
experiment  number. 


highly  suppressed  there  and  eddies  are  of  smaller  scale;  this  interface  corresponds  the  one 
on  Figure  2(bottom). 

The  presence  of  the  density  jump  and  a  velocity  shear  across  it  points  to  a  possibility 
of  generation  of  the  Kelvin-Helmholtz  type  of  instabilities  at  the  interface  that  leads  to  its 
further  erosion.  However,  these  type  of  instabilities  would  be  present  if  the  local  Richardson 
number  (IV2 /Uz2)  is  sufficiently  small.  Eventually  the  flow  would  reach  conditions  stable  to 
K-H  instabilities  as  the  velocity  of  the  mixed  layer  would  gradually  slow  down,  whilst  the 
density  jump  across  the  interface  would  increase.  At  this  point,  the  erosion  of  the  interface 
would  continue  due  to  the  mean  flow  acting  frictionally  on  turbulent  eddies  at  the  interface 
(vortex  scraping).  Thus,  whether  it’s  the  mean  flow  or  its  shear,  there  is  a  mechanism  which 
would  generate  mixing  at  the  stable  interface  and  lead  to  its  further  deepening. 

2.3  Development  of  the  primary  mixed  layer 

As  was  described  earlier,  a  mixed  layer  forms  at  the  disk  and  expands  into  the  fluid  reaching 
the  thickness  of  the  order  of  the  tank  size.  It  will  be  referred  to  as  the  primary  mixed  layer. 
In  certain  cases,  there  are  other  layers  that  form  below  it  (secondary  mixed  layers)  which 
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Figure  4:  A  non-dimensional  depth  of  the  primary  mixed  layer  is  plotted  as  a  function  of 
non-dimensional  time  (left)  and  on  a  log- log  plot  (right). 


are  of  smaller  scales  and  have  much  weaker  interfaces.  These  will  be  discussed  in  the  next 
section. 

For  many  geophysical  applications  it  is  important  to  know  the  growth  of  the  primary 
mixed  layer  with  time  as  it  controls  the  distribution  of  tracers  as  well  as  the  dynamics  of 
the  flow  .  To  present  the  observations,  the  mixed  layer  depth  h  is  non-dimensionalized  by 
the  depth  of  the  fluid  ( h  =  h/Hw),  and  time  t  by  the  period  of  rotation  of  the  disk  (t  =  Qt). 
Figure  4(left)  shows  the  time  evolution  of  the  primary  mixed  layer  for  a  typical  experiment: 
the  layer  grows  rapidly  at  first,  then  the  growth  slows  down  due  to  the  increasing  density 
jump  at  the  base  of  the  layer.  Figure  4(right)  shows  the  thickness  of  the  layer  as  a  function 
of  time  on  a  log-log  plot  with  a  corresponding  linear  fit  ( R 2  =  0.89).  This  fit  suggests  a 
power  law  h  =  Ata,  where  A  and  a  presumably  are  functions  of  the  Richardson  number  - 
the  control  parameter  of  the  experiment.  Indeed,  Figure  5  shows  that  there  is  a  particular 
dependence  of  the  fit  coefficients  on  the  Richardson  number.  The  power  a  stays  relatively 
constant  at  a  value  of  0.22  with  some  trend  towards  smaller  values  at  lower  Richardson 
numbers.  The  coefficient  A  of  the  fit  seems  to  scale  with  the  Richardson  number  as  A  ~ 
Ri~ 0.33  ag  ghown  in  Figure  5(right).  Thus,  the  experimental  data  shows  that  the  non- 
dimensional  mixed  layer  depth  scales  as  h  ~  i??'-0'33  f022. 

To  understand  the  obtained  dependence  of  the  mixed  layer  depth  evolution,  consider 
the  energy  balance  of  the  system.  The  source  for  the  generation  of  the  mean  flow  as  well 
as  the  mixing  of  the  stratified  fluid  is  a  power  produced  by  frictional  stress  due  to  motion 
of  the  disk.  It  scales  as  P  ~  tU  ~  U3,  where  r  ~  U2  is  the  frictional  stress  and  U  is  the 
characteristic  velocity  of  the  mixed  layer  near  the  disk.  Further,  a  linearly  stratified  fluid 
that  was  mixed  to  a  depth  h  would  have  its  gravitational  potential  energy  increased  by 
AGPE  =  J°h(p-  p0)gzdz  r\_/  N2h3,  where  po  is  the  initial  linear  density  profile  and  p  is  the 
one  obtained  by  adiabatic  mixing  the  initial  profile  within  the  layer  of  thickness  h.  At  last, 
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N2«l2  loci  (N2/rf) 


Figure  5:  Coefficients  of  the  power  fit  for  the  depth  of  the  mixed  layer  are  plotted  as 
functions  of  Ri.  On  the  left  is  the  coefficient  a  =  a(Ri );  coefficient  A  =  A(Ri )  plotted  on 
a  log- log  scale  (right). 


the  kinetic  energy  of  the  mixed  layer  would  scale  as  KE  ~  U2h  and  the  dissipation  D  is 
assumed  to  be  proportional  to  the  energy  production  ( D  =  7 P).  Thus,  the  energy  balance 
is  described  by  the  following  equation: 

Y(KE  +  PE )  =  P~D =7  ^-( U2h  +  N2h3)  ~  U3 

At  this  point,  an  assumption  is  made  that  the  flow  reaches  a  state  where  the  kinetic 
energy  of  the  mixed  layer  approaches  a  constant  value  and  the  energy  input  redistributes 
between  potential  energy  and  dissipation.  If  KE  =  cont  then  U  ~  /i-1/2  and  the  energy 
balance  model  becomes  h2ht  rsj  h  3/2  =7-  h  ^2/9  cphe  obtained  power  law  for  the  growth  of 
the  mixed  layer  is  consistent  with  observations  since  they  indicate  a  power  of  0.22  (Figure  5). 
The  dependence  of  the  fitted  coefficient  A  ~  RE1/3  suggests  that  the  assumed  constant 
value  of  the  kinetic  energy  is  a  function  of  Ri  as  well.  This  relation,  could  not  be  determined 
based  on  the  simple  energy  balance  model.  The  assumptions  made  in  constructing  an  energy 
balance  could  be  verified  with  measurements  of  the  velocity  of  the  fluid  using,  for  example, 
PIV  techniques.  In  current  experiment  these  measurements  were  not  produced,  but  for 
further  progress  there  are  of  vital  importance. 

2.4  Dynamics  of  a  secondary  mixed  layer 

Layers  of  weak  stratification  that  form  below  the  primary  layer  will  be  referred  to  as  the 
secondary  mixed  layers.  Figure  6  shows  an  example  of  a  developed  secondary  mixed  layer 
-  it  has  a  distinct  signature  in  density  profile  as  well  as  in  Brunt-Vaisala  frequency,  where 
bases  of  layers  (interfaces)  have  strong  stratification  and  the  layer  itself  is  weekly  stratified. 
Thus,  on  Figure  6(right)  the  strong  peak1  corresponds  to  the  base  of  a  primary  mixed  layer, 
and  the  peak  with  smaller  magnitude  corresponds  to  the  base  of  a  secondary  mixed  layer. 

1  Buoyancy  frequency  was  obtained  from  smoothed  density  profile  to  remove  noise  -  instrumental  as  well 

as  due  to  small  scale  overturning.  This  leads  to  broader  peaks  with  smaller  magnitudes;  nonetheless,  a  clear 

signature  of  mixed  layers  is  preserved. 


246 
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Brunt-Vaisala  frequency 


Figure  6:  A  typical  density  profile  containing  a  primary  mixed  layer  as  well  as  a  secondary 
one  (left). Brunt-Vaisala  frequency  showing  peaks  at  the  bases  of  mixed  layers. 


Note  that,  secondary  mixed  layer  is  only  partially  mixed  as  apposed  to  the  primary  one 
that  is  completely  destratified.  Thus,  secondary  layers  are  characterized  by  the  magnitude 
of  a  peak  in  buoyancy  frequency  at  the  base  of  the  layer,  and  by  a  layer  thickness,  which  is 
the  distance  between  the  two  peaks. 

As  opposed  to  primary  mixed  layer,  that  monotonically  grows  with  time,  the  secondary 
layer  has  a  very  rich  dynamics.  Figure  7  shows  how  the  buoyancy  frequency  evolves  with 
time.  Here,  there  is  a  clear  indication  of  both  mixed  layers  -  the  primary  one  continuously 
grows  and  the  secondary  one  forms  and  decays  multiple  times  throughout  the  experiment. 
Characteristic  thickness  of  the  secondary  layer  is  of  few  centimeters.  The  evolution  process 
could  be  decomposed  in  several  stages.  In  the  beginning,  there  is  a  strong  turbulent  flow  that 
drives  the  growth  of  the  primary  layer  and  does  not  allow  the  formation  and  persistence  of 
the  fine  scale  secondary  layer.  Then  follows  a  period  of  time  when  the  secondary  layer  forms 
and  decays  with  a  characteristic  time  of  few  tens  of  minutes.  The  layer  forms  spontaneously 
having  a  finite  thickness  which  then  decreases  until  this  layer  merges  with  the  primary  one 
(collision  instability)  and  the  process  repeats.  The  next  stage  is  described  by  a  change  in 
the  way  the  layer  forms.  They  appear  at  the  base  of  a  primary  mixed  layer  having  small 
thickness  and  start  to  grow  building  up  the  density  jump  at  the  interface.  However,  the 
magnitude  of  a  jump  vanishes  with  time  (erosion  instability),  the  layer  disappears  and  the 
next  one  forms.  Finally,  comes  the  stage  when  the  layers  either  no  longer  form  or  stay  in  a 
locked  position  relative  to  the  primary  interface. 

The  evolution  of  the  secondary  mixed  layer  is  highly  dependent  on  the  Richardson 
number  for  the  experiment  and  in  a  current  time.  In  some  cases,  certain  stages  of  the 
evolution  could  be  skipped  which  makes  it  hard  to  classify  all  the  regimes.  Nonetheless,  an 
attempt  to  plot  a  regime  diagram  was  made.  Figure  8  shows  the  stages  of  the  evolution 
of  the  mixed  layer  for  different  experiments  as  a  function  of  non  dimensional  time  and 
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Figure  7:  Examples  of  the  time  evolution  of  the  square  of  the  Brunt-Vaisala  frequency  for 
different  values  of  the  Richardson  number  ( Ri  =  0.54, 1,  2.25, 1.67). 


Figure  8:  Regime  diagram  showing  different  stages  of  evolution  of  the  secondary  mixed 
layer 
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Richardson  number.  Below  the  red  curve  steps  generate  below  the  mixed  layer  and  decay 
by  merging,  whereas  in  between  the  red  and  blue  curves  mixed  layers  generate  from  the 
primary  mixed  layer  and  decay  by  erosion.  Above  the  blue  line  the  staircases  are  either  in 
locked  state  or  do  not  exist.  The  diagram  does  not  show  any  specific  dependence,  however 
this  does  not  imply  that  there  is  no  dependence  whatsoever.  There  is  an  obvious  lack  of 
data:  a  total  of  20  experiments  were  made  and  for  less  then  half  of  them  the  secondary  mixed 
layers  could  be  clearly  classified.  Furthermore,  since  there  are  no  theoretical  predictions  it 
is  not  clear  how  to  present  such  a  diagram:  the  most  obvious  choice  of  presenting  in  terms 
of  Richardson  number  and  time  might  not  be  the  clear  one  to  look  at. 

The  dynamics  of  the  secondary  mixed  layers  is  extremely  complicated.  Understanding  it 
to  a  level  of  making  predictions  requires  a  precise  theoretical  model  bucked  up  by  extensive 
observations  of  not  only  density  but  also  velocity  fields,  turbulent  buoyancy  and  momen¬ 
tum  fluxes.  However,  due  to  time  constrains  these  elaborate  measurements  could  not  be 
performed.  Thus,  the  theoretical  part  of  this  research  would  be  focused  on  explaining  the 
formation  of  secondary  layers  in  an  idealized  phenomenological  framework. 

3  Phenomenological  model 

Secondary  mixed  layers  arise  in  a  stratified  fluid  that  is  subjected  to  the  action  of  a  turbulent 
flow  with  a  mean  shear.  The  turbulent  nature  of  the  flow  would  be  the  key  to  understanding 
the  phenomenon.  The  proposed  mechanism  is  as  follows.  The  sheared  mean  flow  driven 
by  the  rotating  disk  is  a  source  of  turbulent  motion  which  inhomogeneously  distributes 
within  a  stratified  fluid  depending  on  its  stratification.  The  buoyancy  itself  is  affected  by 
turbulent  fluxes.  Thus,  the  formation  on  the  layers  could  be  related  to  the  instability  of 
the  turbulent  flux-buoyancy  relation  as  was  first  proposed  by  Phillips  [7]  and  Posmentier 
[8].  Suppose,  the  evolution  of  buoyancy  b  is  controlled  by  the  divergence  of  the  downward 
turbulent  buoyancy  flux  F: 

db  _  dF 

dt  dz 

and  this  flux  is  a  function  of  the  buoyancy  gradient  F  =  F(bz).  The  equation  has  obvious 
steady  state  solutions  with  constant  flux,  however  such  solutions  would  be  unstable  if  the 
flux  would  be  a  decreasing  function  of  buoyancy  gradient  e.i.  <  0.  Figure  9  shows  the 
mechanisms  of  the  instability.  A  linear  steady  state  buoyancy  profile  (dashed  line)  has  a 
value  of  buoyancy  gradient  that  corresponds  to  the  negative  derivative  of  the  buoyancy  flux. 
After  introducing  a  small  perturbation  (black  line)  there  will  appear  anomalous  buoyancy 
fluxes  (blue  arrows).  In  regions  where  the  perturbations  have  higher  buoyancy  gradient 
the  anomalous  flux  is  negative  and  in  it  is  positive  in  regions  of  lower  gradients.  As  could 
be  seen  from  the  evolution  equation  the  current  distribution  of  the  buoyancy  fluxes  would 
lead  to  a  growth  of  the  initial  perturbation,  meaning  that  the  base  state  is  unstable.  This 
instability  would  lead  to  buoyancy  profiles  consisting  of  a  series  of  mixed  layers  separated 
by  strong  interfaces  (staircases).  In  the  case  of  >  0  the  created  fluxes  would  lead  to  the 
decay  of  the  buoyancy  anomaly  -  this  state  is  stable. 

This  model  considers  the  buoyancy  equation  only  and  assumes  a  particular  shape  of 
the  flux-gradient  curve.  However,  the  exact  form  of  this  curve  would  depend  on  particular 
mixing  mechanisms  that  create  the  turbulent  buoyancy  fluxes.  In  the  following  section 
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Figure  9:  Schematic  representation  of  the  instability  mechanism  that  leads  to  layer  forma¬ 
tion.  Dashed  line  represents  the  initial  buoyancy  profile,  black  line  is  the  slightly  perturbed 
profile,  anomalous  buoyancy  fluxes  denoted  by  blue  arrows. 

a  gradient-type  mixing  model  would  be  devised  that  would  describe  the  development  of 
buoyancy  profiles  under  the  action  of  the  mean  flow  generated  turbulence.  The  formation 
of  the  layers  in  the  model  would  be  related  to  the  mechanism  described  above  by  calculating 
the  exact  form  of  the  flux-gradient  relation.  The  model  would  consist  of  evolution  equations 
written  for  buoyancy  as  well  as  for  turbulent  kinetic  energy  that  is  drained  from  the  mean 
flow.  It  is  aimed  to  describe  the  secondary  mixed  layers  only,  so  it  is  assumed  that  the 
primary  layer  has  been  formulated  and  the  mean  velocity  profile  is  established. 

3.1  Model  formulation 

The  approach  taken  in  constructing  a  mixing  model  is  similar  to  that  implemented  by 
Balmforth  [1]  and  Barenblatt  [2].  The  model  parametrizes  turbulent  vertical  transports 
as  gradient  transports  using  eddy  diffusivity  and  viscosity.  Thus,  the  conservation  laws  for 
horizontally  averaged  buoyancy  b  and  turbulent  kinetic  energy  e  are  as  follows 
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where  S  is  a  source  term  for  the  energy  and  k,  is  the  turbulent  diffusivity.  The  source 
term  represents  production  and  destruction  of  the  turbulent  energy  and  in  this  formulation 
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would  consist  of  four  terms: 
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S  =  —nbz  —  a—  H - kUz2  H - — — U2e1/l2l 
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As  it  was  mentioned  previously,  the  mixing  of  buoyancy  leads  to  an  increase  in  a  gravita¬ 
tional  potential  energy  of  the  system.  Thus,  the  role  of  the  first  term  (— k62)  is  to  convert  the 
eddy  kinetic  energy  into  potential  energy  preventing  the  artificial  accumulation  of  the  total 
energy  E  =  f^(e  —  bz)dz.  The  second  term  —  cc^-  is  the  dissipation,  which  is  on  dimensional 
grounds  proportional  to  the  energy  e  and  to  the  inverse  of  the  eddy  turnover  time  r  ~  Ac/e; 
a  is  a  small  parameter  which  controls  the  intensity  of  the  dissipation  and  effectively  defines 
the  number  of  turnover  periods  over  which  an  eddy  would  dissipate.  The  last  two  terms 
are  the  production  terms  due  to  the  action  of  a  mean  flow.  From  visual  observations  of  the 
experiments  there  seem  to  be  two  mechanism  in  action.  At  low  local  Richardson  numbers 
(Ri  =  N2/Uz2),  where  the  destabilizing  effect  of  the  shear  is  dominating  the  stabilizing 
effect  of  the  stratification,  the  turbulence  is  generated  due  to  Kelvin-Helmholtz  type  insta¬ 
bilities.  These  instabilities,  however,  are  suppressed  at  high  Ri.  Hence  the  representation  of 
the  energy  production  is  of  the  form  ^^nU2.  At  high  Ri  the  turbulence  is  generated  by 
the  mean  flow  acting  on  turbulent  eddies  (vortex  scraping)  by  inducing  a  frictional  stress. 
The  amount  of  power  introduced  by  such  a  mechanism  is  proportional  to  U2el^2l~1 ,  where 
U2  is  scaling  for  the  stress  exerted  by  the  mean  flow,  e1/2  is  a  characteristic  velocity  of  an 
eddy  and  l  is  its  length  scale.  Factors  1^j  and  are  chosen  to  ensure  that  the  two 
mechanisms  would  act  in  their  corresponding  regime  of  Richardson  numbers. 

Eddy  diffusivity,  as  well  as  viscosity,  is2  proportional  to  le1^2.  To  close  the  system  the 
characteristic  eddy  size  l  needs  to  be  defined.  Since  the  model  aims  to  describe  mixing  in  the 
stratified  fluid  there  are  several  length  scales  involved.  In  the  limit  of  strong  stratification 
the  size  of  eddies  would  be  proportional  to  the  Ozmidov’s  length  scale  (e/bz)1/2.  This 
restriction  comes  from  the  fact  that  eddies  overturn  density  layers  leading  to  increase  in 
potential  energy.  Thus,  for  a  given  energy  of  an  eddy  and  the  ambient  stratification  there 
would  be  a  restriction  on  its  size.  In  the  limit  of  weak  stratification,  the  characteristic 
length  scale  for  an  eddy  l  =  d  should  be  related  to  the  geometrical  parameters  e.i.  the  size 
of  the  tank3.  Thus,  to  interpolate  the  two  length  scales  in  the  limits  of  strong  and  weak 
stratifications  expression  for  the  eddy  length  scale  l  is  chosen  to  be 


l 2  d2  '  e/bz 


(4) 


where  7  is  a  parameter  that  determines  the  length  scale  in  the  limit  of  strong  stratification. 
It  should  be  pointed  out  that  the  model  is  written  in  terms  of  locally  defined  dynamical 
variables,  meaning  that  all  the  scales  are  controlled  by  the  local  dynamics  and  do  not  depend 
on  global  profiles  of  the  variables.  However,  this  doesn’t  always  take  place  in  reality.  For 
example,  the  model  used  for  the  eddy  length  scale  states  that  in  the  case  of  locally  weak 
stratification  the  length  should  be  equal  to  d  -  some  geometrical  restriction.  However, 
vertical  profiles  of  stratification  containing  mixed  layers  bounded  by  strong  stratification 

2  For  simplicity  eddy  viscosity  and  diffusivity  are  taken  to  be  the  same,  however  this  assumption  could 
be  relaxed 

3A  restriction  of  the  length  scale  in  weakly  stratified  case  avoids  singularities  in  the  model. 
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would  impose  an  equivalent  ’geometrical’  constrain  on  an  eddy  size  in  the  location  within 
the  mixed  layer  -  the  eddy  size  could  not  be  greater  than  the  thickness  of  a  mixed  layer  with 
strong  interfaces.  Thus,  the  parameter  d  should  be  considered  as  effective  one  that  takes 
into  account  geometrical  constrains  as  well  as  the  global  buoyancy  profile.  In  the  current 
model,  for  simplicity  it  is  assumed  to  be  constant. 

A  resulting  model  is  effectively  a  set  of  coupled  nonlinear  diffusion  equations,  which 
require  boundary  and  initial  conditions  to  be  completed.  As  boundary  conditions,  the 
fluxes  of  energy  and  buoyancy  are  set  to  be  zero  at  the  top  and  the  bottom  boundaries  of 
the  domain  e.i.  ez  =  0,  bz  =  0  at  z  =  0,  H.  The  initial  conditions  would  be  discussed  in  a 
section  3.3.  Thus,  the  complete  equation  set  of  the  mixing  model  is  the  following: 
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where  a,  7,  d  are  fixed  parameters;  characteristic  velocity  U  and  velocity  shear  Uz  rep¬ 
resent  the  forcing  for  this  system  and  assumed  to  be  constant.  In  principle,  equations  6 
to  9  could  be  combined  to  give  an  evolution  equation  for  the  turbulent  diffusivity  k  -  the 
variable  that  determines  the  evolution  of  the  buoyancy  profile. 

Since  the  purpose  of  the  model  is  to  describe  the  formation  of  mixed  layers  it  is  conve¬ 
nient  to  use  the  buoyancy  gradient  (g  =  bz)  instead  of  buoyancy  as  a  dynamical  variable. 
The  mixed  layers  would  have  a  distinct  signature  in  buoyancy  gradient  profile:  low  val¬ 
ues  within  the  mixed  layer  and  sharp  peaks  at  the  interfaces.  Further,  it  is  useful  to  use 
non-dimensional  equation  set  with  the  following  definition  of  dimensionless  quantities: 
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In  dimensionless  form  the  equations  are 
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:  ez  =  0,  g  =  0  at  z  =  0,  H 

(14) 

where  a  parameter  Q  describes  the  relative  importance  of  the  mean  flow  and  the  mean 
shear  in  the  energy  production  term;  the  local  Richardson  number  is  proportional  to  the 
buoyancy  gradient  only  (Ri  =  g/ 7). 
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Figure  10:  Steady  state  buoyancy  flux  plotted  as  a  function  of  the  buoyancy  gradient  for 
different  values  of  the  forcing  parameter  Q. 


3.2  Steady  states  and  their  stability 

Consider  a  steady  state  of  the  model,  which  has  a  constant  energy  and  buoyancy  gradient 
(eo>5o)  =  const.  It  violates  the  boundary  conditions,  nonetheless  away  from  the  influence 
of  the  boundaries  it  is  useful  to  consider  its  stability.  Such  steady  states  should  have  no 
production  of  turbulent  kinetic  energy  (S  =  0)  in  order  for  dynamic  variables  not  to  change 
with  time: 


e20  ,  7«o  +  Qg0 

-n0go  -  ay - 1 - — - f—  =  0 

« o  1  +  50/7 


(15) 

(16) 


This  sets  a  constrain,  which  effectively  defines  energy  and  buoyancy  flux  as  a  function 
of  buoyancy  gradient  (or  Richardson  number)  for  the  considered  steady  state  solutions: 


e0  =  E(g0)  (17) 

«o5o  =  F(go)  (18) 

As  was  discussed  earlier,  the  instabilities  would  develop  if  the  initial  profile  has  the 

Richardson  number  which  is  within  the  range  of  the  negative  slope  of  the  buoyancy  flux. 

Figure  10  shows  the  steady  state  buoyancy  flux  as  a  function  of  Richardson  number  (Ri  = 
g/ 7).  The  numerical  values  of  parameters  are  chosen  as  a  =  0.1, 7  =  0.2.  The  representative 
mean  flow  parameter  Q  =  0.8,1.5,2.56.  Curves  with  values  of  Q  <  2.56  show  a  non¬ 
monotonic  behavior,  having  a  region  where  the  derivative  of  the  flux  is  negative.  At  low 
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values  of  Ri,  the  instabilities  are  not  possible  due  to  extremely  high  turbulent  energy  created 
by  shear  instabilities  that  are  not  suppressed  by  the  stratification.  At  high  Richardson 
numbers  the  buoyancy  flux  is  controlled  by  the  turbulence  produced  by  the  mean  flow 
which  is  proportional  to  g 4/2,  hence  the  increase  of  the  buoyancy  flux  with  the  increase  in 
stratification.  However,  the  shear  production  term  is  ~  e/53/2,  which  means  that  there  is 
strong  penalty  in  energy  production  for  an  increase  in  stratification  -  hence  the  turbulent 
flux  due  to  shear  production  should  decrease  with  the  stratification  at  high  Richardson 
numbers.  Thus,  the  effect  of  the  mean  shear  tends  to  destabilize  the  system  and  create 
mixed  layers,  whereas  the  mean  flow  turbulence  has  a  stabilizing  effect.  Hence,  in  the 
intermediate  range  of  Richardson  the  slope  has  a  negative  slope  due  to  shear  production 
term,  however  this  region  is  of  finite  size  due  to  the  action  of  the  mean  flow  turbulence  at 
higher  values  of  Ri.  This  means  that  the  instability  could  not  grow  unbounded  in  time:  at 
some  point  the  effect  of  the  mean  flow  turbulence  would  suppress  the  growth.  Thus,  the 
existence  and  development  of  instabilities  depend  crucially  on  the  relative  importance  of 
the  mean  flow  to  shear  production  terms  which  is  determined  by  the  value  of  the  parameter 
Q.  If  the  forcing  Q  is  greater  than  2.56  the  flux- gradient  curve  monotonically  increases  with 
the  value  of  Richardson  number  -  stabilizing  effect  of  the  mean  flow  is  dominant.  For  such 
forcing  all  the  buoyancy  profiles  would  be  stable. 

The  described  instabilities  could  now  be  identified  from  the  formulated  model  equations 
and  their  growth  rates  could  be  quantified.  Linearizing  the  equations  around  the  base  state 
(eo;5o)  and  assuming  exponentially  growing  solutions  for  perturbations  (e,g)  ~  exp(Xt  — 
ikz )  leads  to  a  linear  stability  problem: 
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where  f  =  Kg  is  the  buoyancy  flux,  S  is  the  source  term  in  the  energy  equation  and  all  the 
partial  derivatives  are  calculated  at  the  point  (eo,  go)-  Thus,  for  a  given  value  of  parameter  Q 
and  initial  value  of  the  Richardson  number  a  growth  rate  A  could  be  calculated  as  a  function 
of  the  wavenumber  k.  Figure  11  shows  growth  rates  calculated  for  Q  =  1.5  and  for  values 
of  base  state  Ri  =  0.3, 0.7, 1.3.  The  growth  rate  is  positive  for  Ri  =  0.7  which  is  located  at 
the  negatively  sloped  part  of  the  flux  curve  (Figure  10).  The  growth  rate  is  negative  for  the 
values  of  Richardson  numbers  of  0.3  and  1.3  that  belong  to  the  monotonically  increasing 
part  of  the  slope  -  these  are  stable  solutions.  An  important  property  of  the  written  model  is 
that  in  the  case  of  the  unstable  steady  states  the  growth  rate  of  the  perturbations  has  has 
a  high  wavenumber  cutoff4  which  means  that  small  scale  perturbations  do  not  grow  with 
time  -  instead,  they  decay.  This  property  allows  to  perform  direct  numerical  simulations 
of  the  development  of  the  instabilities  and  their  evolution  leading  to  the  formation  of  the 
mixed  layers. 


3.3  Numerical  simulations 

Numerical  simulations  of  the  mixing  model  were  performed  using  first  order  in  time  and 
second  order  in  space  numerical  scheme  with  an  adaptive  time  stepping.  Computational 
domain  is  z  =  [0, 1200]  having  600  uniformly  distributed  grid  points.  Initial  conditions  for 

4The  high  wavenumber  cutoff  is  due  to  the  diffusion  of  the  turbulent  kinetic  energy 
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Figure  11:  Growth  rate  plotted  as  a  function  of  the  wavenumber  for  different  values  of  the 
base  state  Richardson  numbers  and  for  a  fixed  parameter  Q  =  1.5  (as  predicted  by  the 
linear  stability  analysis). 


Figure  12:  Numerically  simulated  evolution  of  the  instabilities  manifested  in  the  vertical 
profiles  of  the  Richardson  number. 
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Figure  13:  A  snapshot  of  model  variables  at  t  =  le4:  Richardson  number  (left),  buoyancy 
flux  (middle)  and  turbulent  kinetic  energy  (right). 


the  numerical  simulation  are  chosen  to  be  Ri  =  0.8  for  the  buoyancy  profile5,  e  =  0.01  for 
the  turbulent  kinetic  energy  and  the  forcing  parameter  Q  =  1.  Based  on  a  linear  stability 
analysis  such  a  configuration  should  be  unstable.  Indeed,  the  numerical  simulations  confirm 
it.  Figure  12  shows  snapshots  of  the  modeled  buoyancy  gradient  at  different  time  -  it  is 
clearly  seen  that  the  instabilities  are  developing  in  the  form  of  sharp  peaks  in  buoyancy 
gradient  profiles.  First,  the  instabilities  develop  close  to  boundaries  because  there  are  larger 
perturbation  from  the  steady  state  their.  Then,  they  appear  in  the  interior  and  eventually 
fill  the  whole  domain  except  for  small  regions  adjacent  to  boundaries  which  are  affected  by 
zero  flux  boundary  conditions. 

Figure  13  shows  a  snapshot  of  the  buoyancy  gradient  g,  turbulent  kinetic  energy  e  and 
a  corresponding  buoyancy  flux  /.  The  buoyancy  flux  seems  to  be  relatively  constant  in 
the  interior  region,  however  having  isolated  regions  of  low  flux.  The  energy  distribution  is 
higher  in  the  regions  of  weak  stratification  and  has  minimums  at  within  highly  stratified 
interfaces.  The  instabilities  develop  having  a  wavelength  corresponding  to  the  one  that 
has  a  maximum  growth  rate  predicted  by  a  linear  stability  analysis.  However,  it  has  been 
observed  that  after  the  initial  development  stage  the  structure  tends  to  coarsen  with  time 
by  merging  or  decaying  mechanisms.  The  individual  peaks  are  quasi-steady  solutions  of  the 
equations  and  evolve  by  weakly  interacting  with  each  other.  The  result  of  the  interaction 
could  be  considered  as  a  secondary  instability  that  leads  to  the  coarsening  through  erosion 
or  collision  of  the  adjacent  interfaces  [9].  The  time  scales  for  the  two  instabilities  would 
depend  on  the  base. 

Figure  14  shows  the  time  evolution  of  the  buoyancy  gradient  and  the  buoyancy  flux 
throughout  the  whole  simulation.  The  initial  development  stage  lasts  till  the  time  t  ~  4000 
and  is  characterized  by  the  formation  of  the  peaks  in  buoyancy  gradient.  The  overall 
structure,  however,  tends  to  coarsen.  The  peaks  either  merge  with  each  other  creating  a 

5To  satisfy  initial  conditions  at  the  boundaries  the  distribution  of  Ri  was  chosen  such  that  it  is  constant 
within  the  interior  of  the  domain  and  continuously  approaches  zero  at  the  boundaries. 
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Figure  14:  Time  evolution  of  the  modeled  vertical  profiles  of  the  Richardson  number  (left) 
and  the  buoyancy  flux  (right). 


higher  and  broader  peaks,  or  they  decay  in  amplitude  and  disappear  amplifying  the  adjacent 
peaks(  Figure  14(top)  ).  The  flux  on  the  other  hand,  started  from  an  initial  condition  which 
has  a  weak  constant  value  within  the  interior  and  after  the  instabilities  have  developed  its 
value  also  remained  constant,  however  at  a  higher  value  (Figure  14(bottom)).  Note  that 
the  merging  and  decaying  events  are  accompanied  by  a  transient  decrease  in  buoyancy 
flux.  This  is  clearly  seen  at  Figurel3,  where,  for  example,  the  two  peaks  at  z  ~  550 
or  z  ~  750  are  about  to  merge  and  the  corresponding  values  of  the  buoyancy  flux  are 
decreased.  The  edge  regions  which  are  characterized  by  weak  stratification  and  buoyancy 
flux  are  continuously  growing  free  of  instabilities.  The  end  result  of  such  a  simulation  would 
be  the  full  homogenization  of  the  buoyancy,  since  system  is  forced  by  the  mean  flow  which 
is  a  continuous  source  of  turbulent  mixing. 

The  numerical  simulation,  showed  that  a  series  of  mixed  layers  could  form  in  a  region 
subjected  to  the  action  of  the  mean  flow  turbulence.  However,  the  simulation  was  performed 
in  an  idealized  setting  so  that  it  could  confirm  the  results  of  the  linear  analysis  of  the  stability 
of  the  model.  The  model  only  conceptually  shows  the  mechanism  of  layer  formation.  In 
reality,  there  are  many  more  factors  that  would  contribute  to  the  evolution  of  the  layers. 
First  of  all,  the  mixing  model  uses  constant  characteristic  shear  and  the  mean  flow  as  a 
mechanisms  for  the  generation  of  turbulence.  However,  the  mean  flow  is  affected  by  changes 
in  stratification  and  should  be  accounted  by  the  model  if  the  instabilities  develop  at  a  similar 
time  scale  as  the  variations  in  mean  flow.  Furthermore,  in  the  experiments  there  is  a 
nonuniform  buoyancy  gradient  profile,  as  it  has  a  dominant  peak  at  the  base  of  the  primary 
mixed  layer.  The  existence  of  this  layer  affects  the  evolution  of  the  adjacent  secondary 
mixed  layers.  Nonetheless,  the  model  shows  the  important  features  of  the  evolution  of  the 
secondary  mixed  layers:  spontaneous  generation  of  the  layers,  the  merging  of  the  layers  and 
their  decay  -  all  of  which  have  been  observed  in  the  experiments.  Even  though  the  proposed 
model  simulates  similar  behavior  to  the  observations,  the  assumption  of  the  model  should 
be  verified  experimentally  -  this  would  require  elaborate  measurements  of  the  mean  flow  as 
well  as  the  of  the  turbulent  buoyancy  and  momentum  fluxes. 
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4  Conclusions 


A  series  of  experiments  were  performed  on  a  linearly  stratified  fluid  subjected  to  the  surface 
frictional  stress  induced  by  a  rotating  disk.  The  control  parameters  for  the  experiments  were 
the  magnitude  of  the  stratification  ( N 2)  and  the  rotating  speed  of  the  disk  (O)  resulting 
in  a  single  non-dimensional  parameter  Ri  =  N'2 /O2.  Two  types  of  mixed  layers  have  been 
observed  -  the  primary  layer  extending  from  the  surface  into  the  interior  of  the  fluid,  and 
the  secondary  mixed  layer  of  much  smaller  scale  that  forms  underneath  it.  The  growth 
of  the  primary  layer  with  time  seems  to  obey  a  power  law  h  ~  .  This  result  is 

consistent  with  the  energy  balance  model  that  assumes  a  constant  kinetic  energy  withing 
the  layer. 

The  secondary  mixed  layers  show  a  complicated  behavior,  having  several  regimes  de¬ 
pending  on  time  and  the  Richardson  number  of  the  experiment.  A  physical  mechanism 
for  the  formation  of  the  secondary  mixed  layers  has  not  been  clearly  identified  from  the 
observations  due  to  the  lack  of  observational  data,  namely  the  turbulent  fluxes  and  the 
mean  flow.  Nonetheless,  the  hypothesis  has  been  put  forward  that  relates  the  formation 
of  these  layers  to  an  instability  of  the  flux-gradient  type  that  could  arise  in  stratified  tur¬ 
bulent  flows.  A  one  dimensional  mixing  model  has  been  devised  and  the  instabilities  were 
investigated  theoretically  as  well  as  through  numerical  simulations.  The  resulting  mixed 
layers  form  only  in  a  particular  regime  that  puts  constraints  on  the  magnitude  of  the  mean 
flow  with  respect  the  its  shear  and  the  Richardson  number  for  the  profile.  Furthermore, 
the  dynamics  predicts  the  destruction  of  the  mixed  layers  through  merging  and  decay  -  the 
precesses  observed  in  experiments. 

Based  on  the  mixing  model,  the  results  of  the  experiments  could  be  interpreted  as 
follows.  Supported  by  the  disk  generated  turbulence,  the  primary  mixed  layer  grows  creating 
a  large  density  gradient  at  the  base  of  it.  At  the  same  time,  the  mean  flow  is  established 
within  the  primary  layer.  The  flow  is  suppressed  as  it  reaches  high  stratification  and  thus 
the  shear  is  created  at  the  base  of  the  mixed  layer.  This  sets  a  particular  value  of  the  model 
parameter  Q,  which  reflects  the  relative  importance  of  the  mean  flow  to  the  mean  shear. 
The  magnitude  of  the  flow  as  well  as  of  its  shear  changes  with  time  as  the  mixed  layer  grows, 
meaning  that  Q  is  a  function  of  time  as  well  as  space.  The  formation  of  the  secondary  mixed 
layers  below  the  primary  is  observed  when  the  necessary  conditions  for  the  instabilities  are 
met:  the  parameter  Q  is  smaller  then  the  critical  value,  and  the  local  Richardson  numbers 
belong  to  the  unstable  part  of  the  flux-gradient  curve.  Such  a  constrain  identifies  a  confined 
region  in  a  fluid  where  the  instabilities  could  occur.  The  instabilities  within  this  region  lead 
to  the  development  of  the  layers,  however  the  magnitude  of  the  instabilities  is  suppressed 
by  the  stabilizing  effect  of  the  mean  flow  turbulence.  That  explains  why  secondary  mixed 
layers  are  not  completely  destratified.  After  the  spontaneous  development  of  the  layers, 
comes  the  stage  of  slow  evolution  when  the  layers  either  merge  with  the  primary  interface 
of  decay  by  erosion  -  consistent  with  the  behavior  of  the  observed  layers.  The  change  in  the 
regime  is  the  shift  from  the  dominating  collision  instability  to  the  erosion  instability  which 
is  associated  with  the  change  in  the  base  state. 
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Abstract 

On-off  intermittency  refers  to  the  special  pattern  of  time  series  which 
experience  long  time  periods  of  calm  behavior  followed  by  short  periods  at 
bursting.  In  particular,  the  Ricker  Map,  which  is  a  model  of  the  discrete 
evolution  of  the  expected  population  of  one  species  at  a  given  generation, 
is  implemented  to  generate  on-off  intermittency.  The  on-off  intermittency 
generated  by  the  Ricker  Map  also  exhibits  the  main  characteristics  found 
in  other  dynamical  systems.  On-off  intermittency  in  a  locally  connected 
Ricker  Map  is  the  main  focus  here.  The  signal  transfer  between  adja¬ 
cent  maps  changes  the  stability  condition  compared  with  that  of  a  single 
map,  which  leads  to  different  conditions  for  on-off  intermittency  in  locally 
coupled  Ricker  maps.  The  evolution  and  signal  transfer  in  locally  con¬ 
nected  maps  is  carefully  studied  using  a  simple  continuous  model  which 
contains  similar  structure  and  generates  on-off  intermittency.  Through 
this  analysis,  it  is  found  that  a  specific  map  in  a  locally  connected  struc¬ 
ture  recieves  positively  correlated  signals  from  the  adjacent  maps,  which 
generates  more  on-stages  compared  with  a  single  map  which  has  the  same 
parameters. 


1  Introduction 

A  signal  representing  on-off  intermittency  exhibits  two  different  states.  One  is 
an  ’off’  states,  where  the  variable  remain  almost  constant.  The  other  is  an  ’on’ 
state  which  is  a  short  time  period  of  sudden  bursting  in  the  time  series.  This 
on-off  intermittency  has  attracted  wide  interest  due  to  the  potential  that  many 
phenomena  are  related  to  on-off  intermittency.  Examples  include  the  solar  cycle 
showing  long  time  periods  of  calm  interrupted  by  short  bursting(Platt  et  al. 
1993a),  and  earthquake  occurrence(Bottiglieri  &  Godano  2007). 

Suppose  that  there  is  a  /c-dimensional  invariant  manifold  in  an  n-dimensional 
dynamical  system,  where  n  >  k.  The  invariant  manifold  has  a  chaotic  attractor 
whose  stability  can  be  controlled  by  a  parameter,  p.  If  p  varies  across  a  thresh¬ 
old,  the  distance  between  the  trajectory  and  the  invariant  manifold  remains  very 
close  to  0  for  a  long  time  and  shows  a  sudden  increase  for  a  short  duration.  In 
this  context,  on-off  intermittency  can  be  defined(Platt  et  al.  19936).  This  argu¬ 
ment  is  discussed  by  considering  the  skew  product  structure  in  the  dynamical 
system.  Though  the  skew  structure  is  not  necessary  for  on-off  intermittency, 
it  facilitates  the  analysis  of  on-off  intermittency.  Let  us  consider  the  following 
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equations  : 


X  =  F(X,fi(t)), 

Y  =  G(Y,  v o),  and 
H(t)  =  M(no,Y(t)), 


(1) 

(2) 

(3) 


wherein  the  parameter  /i  for  the  evolution  of  X  is  controlled  by  Y .  In  this 
system,  if  Y  is  responsible  for  the  parameter  /i  traveling  above  and  below  a 
threshold  with  time,  one  could  observe  on-off  intermittency  in  the  time  series 
for  X.  Consider  an  even  simpler  case  of  the  logistic  map  for  time  evolution  of 
X  represented  as  Xt+i  =  rtXt(  1  —  Xt).  Here,  rt  is  the  parameter  determining 
the  stability  of  the  invariant  manifold  Xt  =  0  and  hence  rt  =  Aq  +  cr^,  where 
Aq  is  the  average  of  rt  and  is  uniformly  distributed  noise  between  0  and  1. 
Depending  on  the  choice  of  Ao  and  <r1  we  can  generate  an  on-off  intermittency 
signal  in  X(Toniolo  et  al.  2002). 

This  simple  setting  for  on-off  intermittency  reveals  several  important  char¬ 
acteristics  of  an  on-off  intermittency  signal.  Heagy  et  al.  (1994)  used  uniformly 
distributed  noise  for  driving  a  parameter  to  investigate  the  statistics  of  the  du¬ 
ration  of  off-stages  in  on-off  intermittency.  They  found  that  the  probability 
density  function  <j>  of  the  duration  of  off-stages  is  proportional  to  (f>~  2 ,  which  is 
considered  as  the  distinguishing  characteristic  of  on-off  intermittency  relative  to 
other  non-intermittent  signals  or  other  intermittency  signals.  They  also  found 
numerically  that  the  probability  density  function  does  not  depend  on  the  choice 
of  the  driving  parameters,  which  hints  of  the  possibility  that  the  —3/2-law  could 
be  general  for  on-off  intermittency  generated  by  various  dynamical  structures. 
Toniolo  et  al.  (2002)  sought  out  various  characteristics  of  on-off  intermittency 
using  power  spectrum  analysis  and  found  a  clear  negative  slope  in  a  log-log  scale 
plot.  These  results  can  be  used  to  identify  the  on-off  intermittency  in  the  time 
series  obtained  from  observation  or  experiments.  For  example,  the  time  series  of 
the  earth  quake  occurence  shows  the  —3/2-law  in  the  probability  density  func¬ 
tion  of  the  duration  of  off-stages  (Bottiglieri  &  Godano  2007).  Even  if  we  do 
not  have  an  exact  model  for  earthquake  justifiably,  we  can  speculate  that  the 
occurrence  of  earthquakes  shows  on-off  intermittency. 

This  previous  work  opens  the  possibility  that  we  can  determine  whether  or 
not  a  signal  observed  exhibits  on-off  intermittency.  However,  the  previous  work 
used  a  single  map  to  investigate  the  characteristics  of  on-off  intermittency.  It  is 
not  realistic  that  a  single  dynamical  system  is  totally  isolated  from  other  nearby 
systems.  For  example,  convection  cells  located  in  deep  clouds  can  interact  with 
neighboring  cells,  or  the  population  of  insects  or  fish  at  one  location  can  be 
related  to  those  at  another  location  through  migration  or  sharing  of  resources 
given  by  same  environment.  Considering  the  possible  coupling  with  neighbors 
in  realistic  situations,  it  is  worthwhile  to  investigate  a  simple  form  of  interaction 
between  dynamical  systems  under  conditions  for  on-off  intermittency. 

In  this  report,  we  will  use  the  Ricker  Map,  which  is  a  generalized  version 
of  the  logistic  map  for  population  dynamics.  First,  we  will  check  whether  the 
Ricker  Map  also  contains  on-off  intermittency  in  trajectories  that  travel  near 
an  invariant  manifold.  We  will  also  examine  whether  there  is  consistency  in  the 
characteristics  of  on-off  intermittency  that  have  been  identified  using  different 
maps.  Second,  simple  local  coupling  will  be  defined  for  the  investigation  of  the 
role  of  the  interaction.  Based  on  the  definition  of  local  coupling,  we  will  try  to 
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find  out  how  the  signals  in  the  local  coupling  structure  evolve  and  transfer  to 
adjacent  maps.  Numerical  simulation  and  a  possible  theoretical  approach  will 
be  provided. 


2  Single  Ricker  Map 

The  Ricker  Map  (Ricker  1954)  was  first  suggested  by  Bill  Ricker  to  study  the 
expected  population  of  one  specific  species  at  a  given  generation.  The  Ricker 
Map  is  represented  by  a  single  discrete  dynamical  model  in  which  the  popula¬ 
tion  at  the  generation  t  +  1  is  determined  by  the  previous  generation  t  with  a 
prescribed  growth  rate  and  restriction  given  by  environmental  conditions  like 
food,  habitat  and  water.  The  Ricker  Map  is  represented  as 

Nt+1  =Nter(1~^\  (4) 

where  Nt  is  the  population  at  t  and  r  is  the  growth  rate.  Th  carrying  capacity 
K  represents  the  population  size  of  the  species  that  the  environment  can  hold 
depending  on  resources  like  food  or  water.  For  simplicity  in  this  study,  K  is 
assumed  to  be  equal  to  be  1.0  and  hence  (4)  becomes 

Nt+1  =  (5) 


Figure  1:  Bifurcation  diagram  for  Ricker  Map.  If  r  <  2.0,  Nt  converges  to  1.0. 
But  if  r  >  2.0,  Nt  oscillates  among  several  values,  which  demonstrates  period- 
doubling  bifurcation.  At  r  =  2.0,  one  solution  divides  into  two  branches.  Near 
r  =  2.5,  each  solution  bifurcates  again  and  hence  Nt  travels  around  four  values. 
As  r  increases,  the  periodic-doubling  bifurcations  continue  and  the  number  of 
solutions  grows  dramatically. 

This  discrete  map  shows  different  sequences  of  Nt  depending  on  the  value 
of  r.  If  r  is  less  than  2.0,  Nt  converges  to  1.0.  But  if  r  is  greater  than  2.0,  Nt 
starts  to  oscillate  among  several  values.  This  situation  is  well  displayed  by  a 
bifurcation  diagram.  Figure  1  shows  the  steady  state  solutions  as  a  function  of 
the  value  of  r.  When  r  is  equal  to  2.0,  one  solution  divides  into  two  branches 
which  is  called  a  period-doubling  bifurcation.  Then  near  r  =  2.5,  each  solution 
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bifurcates  again  giving  four  solutions.  As  r  increases  further  then,  the  period¬ 
doubling  bifurcations  continue  and  the  number  of  solutions  grows  dramatically, 
which  is  shown  in  black  in  figure  1.  Based  on  the  bifurcation  diagram,  we  can 
see  that  Nt  =  1.0  is  the  invariant  manifold  whose  stability  is  determined  by  r. 
If  r  is  larger  than  2.0,  Nt  =  1.0  becomes  unstable.  On-off  intermittency  can 
be  generated  when  r  moves  across  2.0  in  time.  Under  the  assumption  that  r 
is  independent  of  Nt  (the  skew  product  structure),  we  have  to  choose  a  way  of 
determining  r  at  each  time  step.  Heagy  et  al.  (1994)  and  Toniolo  et  al.  (2002) ’s 
works  indicate  that  the  method  of  determining  r  does  not  influence  on-off  inter¬ 
mittency.  However,  without  specific  physical  constraints,  we  choose  Gaussian 
white  noise  as  our  driving  method  with  appeal  to  the  central  limit  theorem.  In 
realistic  cases,  r  can  be  affected  by  various  physical  variables.  The  dimension 
of  the  dynamical  system  for  r  could  be  large.  Therefore,  many  physical  or  en¬ 
vironmental  processes  that  evolve  over  a  range  of  frequencies  influence  the  time 
dependence  of  r.  We  can,  therefore,  think  of  the  change  of  r  in  the  context  of 
Brownian  motion  as  r  =  rg  +  er£,  where  r0  is  the  mean  of  r,  £  the  Gaussian 
white  noise  whose  mean  and  standard  deviation  are  0  and  1.0,  respectively,  and 
a  the  intensity  of  noise.  Hence  the  parameter  r  is  controlled  by  two  variables, 
ro  and  a.  We  can  ask  if  on-off  intermittency  is  always  seen  in  \Nt  —  1.0|  in  the 
parameter  space,  (ro ,  a) ,  when  r  travels  across  2.0. 

Figure  2  shows  time  series  with  different  ro  and  a.  For  (a)  and  (c),  we  choose 
ro  =  2.1,  so  that  ro  is  in  the  parameter  region  where  Nt  =  1.0  is  unstable. 
Then  r  is  driven  by  Gaussian  white  noise  whose  amplitude  is  a.  In  (a),  a  is 
chosen  to  be  0.2,  which  does  not  produce  an  intermittent  signal  even  though 
r  moves  through  2.0.  The  signal  shown  in  figure  2(a)  is  similar  to  a  noisy 
chaotic  signal.  When  er  is  increased  to  0.4  in  (c),  the  signal  shows  clear  bursting 
after  long  calm  period,  which  can  be  recognized  as  on-off  intermittency.  On-off 
intermittency  is  generated  through  a  particular  perturbation  of  r.  Figure  2(b) 
shows  an  intermittent  signal  where  r  =  7.0  is  chosen  to  be  constant,  but  it  is  not 
on-off  intermittency.  On-off  intermittency  is  generated  when  the  perturbation 
of  r  changes  the  stability  of  the  fixed  point  at  each  time  step.  The  distinction 
between  the  intermittency  shown  in  (b)  and  the  on-off  intermittency  shown 
in  (c)  cannot  be  understood  without  information  about  r.  We  need  further 
analyses  to  make  a  clear  distinction. 

According  to  figure  2,  on-off  intermittency  emerges  only  for  certain  choices 
of  r0  and  a.  Metta  et  al.  (2010)  suggested  a  method  to  find  on-off  intermittency 
by  first  computing  the  stability  curve  in  a  parameter  space  which  consists  of 
the  mean  value  of  the  parameter  and  the  intensity  of  the  noise.  They  then  used 
kurtosis  to  locate  on-off  intermittency  in  the  parameter  space.  Larger  values  of 
kurtosis  represent  a  signal  that  can  be  considered  as  on-off  intermittency.  We 
can  follow  the  procedure  here.  First,  we  can  think  of  the  stability  of  Nt  as  a 
function  of  Tq  and  cr.  A  necessary  condition  for  on-off  intermittency  is  that  r 0 
and  cr  must  be  chosen  to  lie  inside  the  unstable  region  in  the  parameter  space, 
(ro,cr).  Unless  it  is  unstable,  the  time  series  converges  to  a  fixed  point  so  that 
the  emergence  of  bursting  is  unlikely. 

Consider  the  linearized  equation  near  the  fixed  point,  Nt  =  1.0,  for  which 
we  can  let  Nt  =  r/t.  +  1.  After  ignoring  higher  order  terms,  this  gives 

\Vt+l\  =  |l-r0-cr£t||f?f|.  (6) 
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Figure  2:  Sample  time  series  of  the  driven  variable  Nt  in  the  different  r o  and  a. 
(a)  r0  =  2.1,  a-  =  0.2;  (b)  r0  =  7.0,  a  =  0.0;  (c)  r0  =  2.1,  a  =  0.4.  (a)  shows 
noisy  chaotic  signal,  which  is  non-intermittent  signal,  (b)  is  a  kind  of  intermit¬ 
tent  signal  but  not  on-off  intermittency.  (c)  shows  an  on-off  intermittency. 


From  this  relationship  between  Nt+i  and  Nt,  we  can  determine  |?yt|  as 


t-i 

\Vt\  =  II  I1  “  ro  -  a£k\\Vo\-  (7) 

k= 0 

Taking  the  logarithm  of  both  sides,  the  equation  becomes 


t- 1 

log|%|  =  5Z1°sl1  ~ro-  a€k\  +log|?7o| 
k= 1 

=  t  <  log|l  -  r0  -  cr£fc|  >  +log|j7o|.  (8) 

Here  <  •  >  represents  the  ensemble  average  and  <  log 1 1  —  r0  —  a£k  \  >  can  be 
considered  as  the  exponent  of  \rjt\-  Therefore,  the  stability  curve  satisfies 

<  log|l  -  r0  -  cr£fc|  >  =  0.  (9) 

The  calculation  of  the  ensemble  average  can  be  accomplished  using  the  known 
probability  density  function  for 

/OO 

p(z)log|l  -  r0  -  az\dz,  (10) 

-CO 

where  p(z)  is  the  normal  distribution  whose  mean  and  standard  deviation  are 
0.0  and  1.0,  respectively. 

Figure  3(a)  shows  the  stability  curve  in  (ro,a)  parameter  space,  calculated 
using  the  equation  (10).  In  the  region  bounded  by  the  ro  axis  and  the  a  axis 
and  the  stability  curve,  all  time  series  converge  to  the  fixed  point  1.0.  Inside 
this  region,  it  is  impossible  to  find  on-off  intermittency.  On-off  intermittency 
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Figure  3:  Stability  curve  and  kurtosis  at  each  r0  and  a.  (a)  The  stability  curve 
is  calculated  using  <  log |1  —  ro  —  <r£fc |  >  =  0.0.  (b)  At  each  ro  and  cr,  kurtosis 
is  calculated  after  generating  the  time  series. 


only  exists  outside  of  this  region.  To  find  the  on-off  intermittency  outside  of  this 
stable  region,  kurtosis  is  a  useful  measure.  Kurtosis  is  the  statistical  quantity 
used  to  detect  infrequent  extreme  deviations.  Higher  kurtosis  in  this  parameter 
space  representative  of  on-off  intermittency.  In  figure  3(b),  we  can  see  higher 
values  of  kurtosis  in  the  area  very  close  to  stability  curve.  As  suggested  by 
previous  research  (Metta  et  al.  2010),  there  is  on-off  intermittency  in  the  Ricker 
Map  of  the  unstable  region  in  parameter  space  and  very  near  the  stability  curve. 

In  the  Ricker  Map,  we  have  information  about  the  parameters,  ro  and  cr, 
which  can  lead  to  an  on-off  intermittency  signal.  Without  information  concern¬ 
ing  ro  and  cr,  however,  we  could  be  led  by  only  looking  at  the  signal  itself.  We 
need  to  seek  other  characteristics  of  on-off  intermittency.  First,  we  can  examine 
the  power  spectrum  of  the  time  series  generated  by  the  Ricker  Map.  In  figure 
2,  we  used  three  time  series  including  one  on-off  intermittency  case.  We  can 
compare  the  power  spectrum  of  these  three  cases.  Figure  4  shows  the  power 
spectrum  of  the  three  cases  shown  in  figure  2.  Case  (a)  shows  the  power  spec¬ 
trum  of  the  non-intermittency  case  which  looks  like  a  noisy  chaotic  time  series. 
For  lower  frequencies,  the  slope  is  almost  flat,  which  is  analogous  to  white  noise. 
We  can  see  slope  at  higher  frequencies.  Case  (b)  has  intermittency  signals  but 
not  on-off  intermittency.  Instead  of  a  slope,  the  power  is  intensified  in  higher 
frequencies.  Case(c)  shows  on-off  intermittency.  Unlike  the  previous  cases,  a 
clear  negative  slope  is  shown  from  low  frequencies  to  high  frequencies.  The 
slope  shown  in  (c)  can  be  considered  as  one  of  main  features  contained  of  on-off 
intermittency. 

Another  characteristic  we  have  to  consider  is  the  duration  of  off-stages  in 
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Figure  4:  Power  spectrum  of  three  different  cases,  (a)  r0  =  2.1,cr  =  0.2;  (b) 
To  =  7. 0,(7  =  0.0;  (c)  r0  =  2.1, cr  =  0.4 


the  time  series.  The  duration  of  an  off  stage  is  the  time  step  between  two 
consecutive  on  stages.  The  off  stage  can  be  defined  in  terms  of  the  signal  being 
below  a  certain  value.  Here,  we  will  use  0.001  for  the  criterion  for  the  off  stage. 
Using  the  logistic  map  and  a  random  process  uniformly  distributed  between 
0  and  1  for  driving  parameter,  Heagy  et  al.  (1994)  proved  that  the  probability 

_  3 

density  function  (PDF)  of  T ^  (the  duration  of  off-stages)  is  proportional  to  2 . 
They  also  showed  numerically  that  the  —3/2  law  is  maintained  regardless  of  the 
details  of  the  driving  method.  These  could  be  another  random  process  or  any 
chaotic  map.  We  can  expect  the  same  characteristics  of  on-off  intermittency  to 
occur  in  the  Ricker  Map.  Figure  5  shows  the  probability  density  function  of 
the  duration  of  off-stages  for  three  cases.  The  red  fine  is  the  PDF  for  on-off 
intermittency,  which  is  very  close  to  a  -3/2  slope  line  in  log- log  scale  plot.  The 
green  line  represents  the  PDF  for  the  noisy  chaotic  case  when  ro  =  2.1  and 
a  =  0.2.  Here,  the  slope  of  the  probability  density  function  is  quite  different 
from  -3/2.  The  last  line  is  for  another  type  of  intermittency  when  ro  =  7.0  and 
cr  =  0.0,  which  is  also  distinct  from  on-off  intermittency. 

The  —3/2  law  can  be  used  as  a  characteristic  to  determine  whether  a  time 
series  exhibits  on-off  intermittency.  But  it  must  be  understood  that  this  is  not 
proven  mathematically  in  general.  To  determine  whether  a  given  time  series 
shows  on-off  intermittency,  we  have  to  use  multiple  analyses  including  power 
spectra  and  the  PDF  of  the  duration  of  off  stages.  Finally,  if  possible,  we  will 
try  to  construct  a  dynamical  system  for  the  time  series  and  find  the  parameters 
necessary  for  the  generation  of  on-off  intermittency. 

A  single  Ricker  Map  with  Gaussian  white  noise  generates  on-off  intermit¬ 
tency  near  the  fixed  point  Nt  =  1.0.  The  power  spectrum  and  probability 
density  function  of  the  duration  of  off-stages  of  the  Ricker  Map  are  consistent 
with  the  results  at  several  previous  studies.  The  consistencies  shown  in  on-off 
intermittency  of  the  Ricker  Map  make  it  possible  for  us  to  use  the  Ricker  Map 
for  further  analysis  of  on-off  intermittency.  As  noted  in  the  introduction,  one 
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Figure  5:  Probability  density  function  of  the  duration  of  off-stages  in  log-log 
scales.  On-off  intermittency  case  is  compared  with  two  other  cases.  One  of 
them  is  noisy  chaotic  time  weries  with  ro  =  2.1  and  a  =  0.2.  The  other  is  other 
intermittency  case  when  r0  =  7.0  and  a  =  0.0. 

particular  system  is  not  totally  isolated  from  other  similar  systems.  More  specif¬ 
ically,  a  group  of  species  whose  population  is  governed  by  the  Ricker  Map  can 
interact  with  other  nearby  groups  of  same  species.  The  interaction  can  occur  in 
various  forms.  One  example  is  the  migration  from  one  group  to  nearby  groups. 
Another  example  is  sharing  of  environmental  resources.  Therefore,  our  next 
step  is  to  examine  how  on-off  intermittency  changes  through  in  a  single  map 
under  the  possible  interactions  with  adjacent  maps. 

3  Locally  Connected  Ricker  Maps 

Now  we  examine  the  dynamics  of  connected  maps.  For  example,  one  group  of 
salmon  can  interact  with  other  groups  via  migration  and  sharing  food  or  re¬ 
sources.  Depending  on  the  system,  the  form  of  the  interaction  may  be  different. 
As  a  starting  point,  the  interaction  can  be  assumed  to  be  linear  and  local,  which 
implies  that  maps  are  diffusively  coupled.  In  a  Ricker  Map,  local  coupling  can 
be  understood  as  the  dynamics  of  a  metapopulation  composed  of  many  different 
populations  that  share  individuals  between  different  sites. 

Lets  consider  the  case  of  K  locally  connected  maps  all  of  which  have  the 
same  parameter  values, 


where  k  =  1,2, ...,  AT,  and 


(12) 


267 


where  =  ro+crW^,  and  Wf  is  Gaussian  white  noise  whose  mean  and  standard 
deviation  are  0  and  1,  respectively.  Also  Wf  is  ^-correlated  in  space  and  time 
so  that  <  Wf  W™  >  =  SkmSts ■  The  coupling  coefficient  e  determines  the 
intensity  of  the  local  coupling.  We  also  assume  periodic  boundary  conditions 
which  means  that  the  Kth  map  is  coupled  to  the  ( K  —  l)th  map  and  1st  map. 
The  periodic  boundary  conditions  mimic  an  infinite  array  of  population  sites. 
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Figure  6:  Signal  pattern  in  a  locally  coupled  map.  r o  =  2;  1,  <r  =  0.4,  and 
e  =  0.01  are  used  for  the  map. 


Figure  6  shows  an  example  of  the  local  coupling  of  many  maps,  tq  =  2.1, 
a  =  0.4,  and  e  =  0.01.  “On  stages  ”,  seen  as  bright  color  dots,  appear  from  time 
to  time  after  long  period  of  “off  stages  ”.  The  on-stages  in  one  specific  map 
seem  to  be  related  to  that  of  the  adjacent  maps.  Sometimes,  the  on-stage  in 
one  map  seems  to  trigger  in  adjacent  maps.  We  see  a  cluster  of  on-stages  in  the 
same  space  and  time  domain.  This  might  be  due  to  transfer  between  adjacent 
maps.  Signal  transfer  can  shape  the  on-off  intermittency  in  the  case  of  a  local 
coupling. 

A  simple  question  is  whether  or  not  the  local  coupling  is  helpful  in  generating 
more  on-stages.  Figure  7  shows  three  examples  of  time  series  of  one  specific  map 
among  locally  coupled  maps.  The  coupling  coefficient  e  is  different  for  the  three 
examples.  The  first  case  has  no  coupling,  the  second  has  e  =  0.01  and  the  third 
one  has  e  =  0.1.  According  to  the  figure,  the  number  of  on  stages  generated 
during  a  fixed  time  increases  as  e  increases.  Even  though  e  is  quite  small,  the 
effect  from  this  weak  coupling  is  substantial.  Figure  7  demonstrates  that  even 
a  weak  coupling  can  change  the  statistics  of  on-off  stages  dramatically. 

In  a  single  map,  the  most  important  issue  is  where  on-off  intermittency  exists 
in  the  parameter  space;  it  exists  in  the  unstable  region,  but  very  close  to  the 
stability  curve.  Finding  where  on-off  intermittency  occurs  in  a  parameter  space 
must  begin  with  the  determination  of  the  stability  curve.  For  simplicity,  we  start 
with  a  3-map  local  coupling.  Even  though  only  three  maps  are  locally  connected, 
we  expect  to  find  general  characteristics  of  infinite  coupled  maps  due  to  locality 
and  periodic  boundary  conditions.  Generally,  finding  the  stability  condition  for 
a  point  in  a  parameter  space  is  equivalent  to  finding  the  Lyapunov  exponent. 
If  the  exponent  is  positive,  it  means  the  system  is  unstable.  The  stability  curve 
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Figure  7:  Time  series  of  one  specific  map  in  locally  coupled  maps.  Here,  ro  =  2.1 
and  er  =  0.4.  The  only  difference  among  three  examples  is  the  magnitude  of  e, 
the  coupling  coefficient. 


is  given  by  the  set  of  the  points  where  Lyapunov  exponent  is  zero.  To  find 
the  Lyapunov  exponent,  we  have  to  linearize  equation  (8)  near  the  fixed  point 
Nt  =  1.0.  If  I\  =  3,  our  linearized  equation  is 


Zt+ 1  —  YtZt, 


(13) 


where  Z*  =  N*  —  1.0  and 


Yt  = 


(1  ^  e)(!  -  r0  -  cr£i)  f(l -ro -CT&)  §(1  -  n>  -  ct£3) 

§(1  —  r0  —  cr£i)  (1  -  <0(1  -  r0  -  a&)  f  (1  -  r0  -  a£3) 


f(l-ro-crfi) 
Therefore,  Zt+ 1  is 


|(1  -r0-<T&)  (1  -  e)(l  -  r0  -  cr^3) 


Zt+i = (nnj  z°- 


Here,  the  Lyapunov  exponent  A  can  be  found  using 

1 


A  =  lim  -  In  1 1  IT  Ys  \  \ . 

t— >oo  t 


(14) 


(15) 


s=0 


It  is  known  that  a  limit  for  the  calculation  of  A  exists  even  though  the 
matrix  is  randomly  determined  at  each  time  step.  To  determine  the  change  of 
the  Lyapunov  exponent  with  local  coupling,  we  fix  rg  =  2.1  and  then  vary  a 
and  e  .  Figure  8  shows  the  Lyapunov  exponent  in  the  parameter  space  (er,  e) 
with  fixed  r g  =  2.1.  For  a  single  map,  with  a  larger  than  0.42,  the  Ricker  Map 
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Figure  8:  Lyapunov  exponent  in  the  parameter  space  (cr,  e)  with  fixed  ro  =  2.1. 


becomes  stable.  In  figure  8  we  find  a  =  0.42  when  e  =  0.0.  Interestingly,  a 
sudden  change  of  stability  with  local  coupling  is  formed  for  0.42  <  a  <  0.53 
where  the  stability  condition  suddenly  changes  with  coupling.  Even  a  weak 
coupling  makes  the  system  unstable.  If  a  is  large  enough,  local  coupling  cannot 
change  the  stability.  Lets  consider  a  scenario  relevant  for  this  analysis.  We 
think  of  the  population  of  salmon  governed  by  the  Ricker  Map.  The  growth 
rate  r  of  one  generation  of  salmon  can  be  assumed  to  fluctuate  due  to  a  change 
in  environmental  resources  at  every  time  step.  If  the  standard  deviation  of  the 
fluctuation  exceeds  a  certain  threshold,  for  example  a  =  0.42,  the  population  of 
the  salmon  always  maintains  the  fixed  value  even  after  many  generations.  Under 
the  same  environmental  conditions,  which  implies  that  the  mean  and  standard 
deviation  of  r  are  same,  we  can  think  of  a  case  where  salmon  in  a  group  start 
to  migrate  to  nearby  population  groups.  In  this  case,  we  expect  a  sudden  sharp 
increase  of  the  salmon  population  at  some  time  step. 

Figure  8  shows  the  increase  of  Lyapunov  exponent  with  e  within  some  range 
of  a.  If  we  remember  that  on-off  intermittency  is  seen  near  the  stability  curve 
in  the  parameter  space,  we  surmise  that  on-off  intermittency  formed  in  a  single 
map  could  disappear  with  strong  local  coupling.  First,  we  can  check  the  power 
spectrum  of  the  time  series  at  each  coupling  coefficient.  We  know  that  a  clear 
negative  slope  is  found  for  on-off  intermittency  in  the  single  map  analysis.  Figure 
9  shows  the  power  spectra  for  several  values  of  e.  These  power  spectra  are 
made  by  averaging  the  signals  in  213  maps  after  214  time  steps.  For  the  weak 
couplings,  when  e  is  0.1  or  0.2,  the  clear  negative  slope  is  shown  in  power 
spectrum.  However,  when  e  =  0.5  or  e  =  0.8,  flatness  over  low  frequencies 
appear,  which  also  appeared  in  the  noisy  chaotic  signal  in  the  single  map  when 
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r0  =  2.1  and  a  =  0.2. 


Figure  9:  Power  Spectra  for  several  local  couplings. 

We  have  seen  the  change  of  stability  and  the  statistics  of  on  and  off  stages 
due  to  local  coupling.  This  change  could  be  made  more  evident  if  we  trace 
how  a  certain  on-stage  is  generated  and  how  it  interacts  with  other  maps.  For 
this  job,  we  create  213  maps  locally  coupled  and  run  the  system  for  214  time 
steps.  A  substantial  number  of  on-stages  can  be  sampled  in  the  time  and  space 
domain.  In  this  sampling,  we  also  collect  adjacent  points  centered  on  the  on¬ 
stage  in  both  the  spatial  and  time  domains.  Figure  10  shows  the  average  map 
over  samples  collected  as  described  above.  For  this  figure,  we  chose  20  adjacent 
maps  and  40  time  steps  for  sampling.  The  time  step  is  from  lag  -20  to  lag  20 
centered  on  the  on-stage.  From  this  figure,  we  see  how  the  signal  in  locally 
coupled  maps  evolves.  Before  the  on-signal  is  generated,  we  see  an  increase  of 
the  signal  of  the  map  together  with  the  adjacent  maps.  The  on  signal  is  not 
generated  alone  randomly.  It  can  be  correlated  with  adjacent  maps  so  that  the 
signals  all  increases  together  for  the  on-signals.  This  argument  is  refined  further 
in  the  next  section.  After  an  on-signal  is  generated,  the  signal  decays  and,  at 
the  same  time,  diffuses  away  to  adjacent  maps.  It  is  seen  as  a  sharp  V-shape, 
which  may  be  related  to  the  V-shape  propagation  of  on-signals  in  the  spatial 
and  time  domains  from  time  to  time. 

The  characteristics  of  on-off  intermittency  shown  in  this  local  coupling  seems 
to  be  determined  by  how  the  signal  is  transfered  and  evolves  with  adjacent  maps. 
We  can  conclude  from  numerical  analysis  that  more  on-signals  are  generated 
through  the  coupling.  In  a  certain  range  of  a,  the  coupling  even  changes  the 
stability  of  the  map.  This  motivates  a  theoretical  approach  to  explain  how  the 
signal  of  a  specific  map  evolves  with  that  of  adjacent  maps. 

4  Signal  Transferring  in  locally  connected  maps 

Compared  with  a  single  Ricker  Map,  without  rigorous  quantification,  we  sur¬ 
mise  that  signals  are  coming  to  each  map  from  adjacent  maps,  which  may  lead 
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Figure  10:  Average  map  over  the  area  centered  on  on-stages  in  the  spatial  and 
time  domains. 


to  more  on  stages  in  time  series  of  locally  coupled  maps.  However,  without 
aid  from  numerical  results,  we  can  think  of  two  different  effects  due  to  local 
coupling.  As  suggested  above,  more  signals  will  come  into  each  map  from  adja¬ 
cent  maps.  Alternatively,  the  growing  signal  in  each  map  could  be  diffused  and 
transfered  to  adjacent  maps,  which  could  lengthen  the  duration  of  the  off  stage. 
Finding  which  one  is  more  dominant  or  how  to  determine  which  case  is  preferred 
requires  more  precise  analysis.  However,  these  two  possible  effects  only  treat 
signal  transferring.  As  we  know,  each  map  has  its  own  system  for  increasing  or 
decreasing  signals,  which  might  be  a  broader  sort  of  reaction.  Therefore,  the 
combination  of  diffusion  and  reaction,  which  are  randomly  controlled,  influences 
the  signal  pattern  in  the  locally  coupled  maps. 

On-off  intermittency  is  characterized  by  long  periods  of  calm  stages  and 
short  periods  of  bursting.  During  a  long  period  time  of  the  off-stage,  the  signal, 
which  is  the  distance  from  the  fixed  point,  is  negligibly  small.  Even  though 
we  can  easily  ignore  the  tiny  signals  during  off-stages,  the  occurrence  of  the 
on-signal  is  completely  governed  by  how  the  signal  evolves  during  the  off-stage, 
which  implies  that  the  generation  of  the  on-stage  is  completely  explained  by 
the  linearized  equation  near  the  fixed  point  or  more  generally  the  invariant 
manifold.  The  role  of  nonlinearity  is  to  prevent  further  increase  of  the  signal 
after  the  emergence  of  the  on-signal  and  to  make  the  signal  return  to  the  off 
stage.  Heagy  et  al.  (1994)  proved  the  —3/2  law  for  the  PDF  of  the  duration  of  off 
stages  using  a  linearized  equation  near  the  fixed  point,  which  was  possible  due 
to  the  fact  that  the  generation  of  the  on-signal  is  governed  by  the  signal  growth 
during  the  off  stage.  Metta  et  al.  (2010)  explained  the  signal  synchronization 
shown  in  two-map  coupling  based  on  the  signal  matching  during  the  off  stage. 
So,  how  the  signal  grows  during  the  off  period  is  the  key  process  for  explaining 
the  statistics  of  the  on-stages.  Returning  to  the  locally  coupled  maps,  the  main 
question  is  how  signals  grow  and  exchange  information  during  off  stages  in 
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locally  coupled  maps.  From  this  perspective,  linearized  equations  near  the  fixed 
point  during  a  particular  off  stage  will  be  considered. 

Despite  the  use  of  linearized  equations,  discrete  maps  are  difficult  to  analyze 
analytically.  A  continuous  model  is  more  useful  and  can  be  constructed  using 
stochastic  calculus.  Therefore,  we  use  a  time-continuous  model  qualitatively 
similar  to  the  Ricker  map  to  try  to  generate  on-off  intermittency.  The  model  is 
a  canonical  cubic  ODE, 


—  =  lix-  x3,  (16) 

whose  stability  near  x  =  0  is  determined  by  the  sign  of  p:  if  p  is  positive 
(negative),  equation  (16)  is  unstable  (stable).  The  similarity  to  the  Ricker  map 
is  seen  as 


dx 

dt 


d  d  1 

~nv = 


(17) 


where  the  shape  of  V  can  tell  us  the  stability  of  x  =  0  and  where  stable  solutions 
exist.  Figure  11  shows  the  shape  of  V  depending  on  the  sign  of  p  near  x  = 
0.  When  p  changes  from  negative  to  positive,  the  stable  solution  near  x  = 
0  becomes  unstable  and  two  stable  solutions  near  x  =  ±^/p  appear.  This 
situation  is  analogous  to  a  period-doubling  bifurcation  when  r  becomes  slightly 
larger  than  2  in  the  Ricker  map.  Therefore,  if  p  is  perturbed  near  0,  on-off 
intermittency  may  be  generated.  Due  to  this  qualitative  similarity,  we  can  use 
this  continuous  model  to  investigate  on-off  intermittency  in  locally  connected 
maps. 


g  <  0 


g  >  0 


Figure  11:  Shapes  of  V  depending  on  the  sign  of  p 


First,  it  is  necessary  to  analyze  on-off  intermittency  of  the  single  map  in  this 
continuous  model.  Near  x  =  0,  the  linearized  equation  is 


dx 

dt 


=  vx, 


(18) 
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where  /r  is  perturbed  by  some  external  process.  Here,  we  simply  choose  Gaussian 
white  noise  to  perturb  pas/x  =  /io  +  cr£,  where  £  and  o  represent  Gaussian  white 
noise  and  intensity  of  the  noise  respectively  Moreover  Gaussian  white  noise  £ 
can  be  written  as  time  derivative  of  a  Wiener  process  dW/dt.  Equation  (18) 
becomes  and 


dx  =  fioxdt  +  axdW. 


(19) 


Among  other  systems,  this  equation  describes  the  evolution  of  option  prices, 
where  it  is  called  the  Black-Scholes  model.  Ito-calculus  provides  us  with  an 
exact  solution,  which  is 


x  =  exp 


1 

2 


o2)t  +  oW 


(20) 


We  can  determine  the  stability  condition  from  the  solution  as 

<  (Mo  -  2)t  +  crW  >=  (no  -  \o2)t  >  0. 

Therefore,  the  stability  curve  in  the  (/io,er)  domain  is  p  =  \o2.  We  expect 
on-off  intermittency.  Figure  12  shows  a  example  of  on-off  intermittency  near 


g0=0.33,  o=0.8 


Figure  12:  On-off  intermittency  when  fig  =  0.33  and  o  =  0.8 


stability  curve  in  this  continuous  model. 

We  now  construct  the  locally  coupled  maps.  We  focus  on  signal  evolution 
and  transfer  during  a  certain  off-stage  so  that  the  linearized  equation  near  x  =  0 
will  be  considered  in  local  coupling.  Our  equation  is 


dxk 

dt 


fJ-oXk  +  &£kXK  +  e^o 


Xk+l  +  Xk- 1 

- o - xk 


+  to 


f  lAfc+1  ^,k—  lXk  —  1 

V  2 


(21) 
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where  k  and  e  are  the  index  of  each  map  and  the  coupling  coefficient,  respec¬ 
tively.  We  can  let  Xk  =  yk^ot  to  remove  the  first  term  in  the  equation  (21), 
thereby  giving 


dyk  fyk+l+Vk-l  \  .  f  .  ( £,k+lVk+l  +  ik-lVk-l 

~dt  =  e/X0  V - 2 - yk)  ^kXk  +  ea  l - 2 - 


which  can  be  written  in  matrix  form  as 

dY 


where 


and 


dt 


—  6/i0AY  +  a(I  +  eA'jY^, 


A  = 


/-I 
1 
2 
0 


u 


h  o  o 
-i  5  o 

h  -i  h 


h\ 

o 

0 

J 


Y  =  (yi,y2, . ,yn)T, 

Y£  =  (yi€l.  2/2^2, . ,ynin)T- 


(22) 

(23) 


Here,  A  describes  the  structure  of  the  periodic  connection  and  is  a  symmetric 
matrix.  We  can  easily  construct  the  integral  form  of  Y  (t) ,  which  is  written  as 

Y(t)  =  exp(ey0At)Y (0)  +  a(I  +  eA)  f  exp(ey0A(t  —  t'))YdW,  (24) 

Jo 

where 


YdW  =  (y1W1,y2W2, . ,ynWn), 

and  again  W  represents  Wiener  process.  Because  A  is  symmetric,  it  can  be 
diagonalized  and  represented  as  A  =  QDQT  where  D  is  the  diagonal  matrix 
whose  diagonal  terms  are  the  eigenvalues  of  A.  The  above  integral  equation 
becomes 

Y(t)  =  Qexp(eyoDt)QTY(0)  +  aQ(I  +  eD)  f  exp [enoD(t  —  t')]YdW  (25) 

Jo 

This  integral  form  does  not  immediately  give  us  any  information.  Even  though 
we  cannot  expect  a  Gaussian  distribution  for  the  PDF  of  Y ( t ) ,  which  means  we 
have  to  know  higher  order  moments,  the  correlation  could  give  us  some  impor¬ 
tant  information  regarding  the  signal  transfer  between  adjacent  maps.  From  the 
above  integral  equation,  we  can  construct  the  covariance  matrix.  Multiplying 
the  transpose  of  Y  ( t )  and  taking  ensemble  average,  the  covariance  matrix  can 
be  represented  as 

<  Y(t)Y(t)T  >  =  Qexp(ey0Dt)QT  <  F(0)FT(0)  >  Qexp(ey0Dt)QT 

+  a2Q(I  +  eD)  f  f  exp(e/Uo-D(f  —  t')QT  <  (YdW)(YdW)T  >  Qexp[ey,0D(t  —  t')](I  +  eD)QT . 
Jo  Jo 

(26) 
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To  simplify  equation  (26),  we  need  information  about  the  initial  covariance 
structure.  We  seek  the  average  behavior  of  signal  transfer  and  hence  it  is  rea¬ 
sonable  to  initially  assume  that  each  map  is  uncorrelated  with  any  other  map 
and  the  second  moment  of  each  map  is  identical.  Mathematically,  the  assump¬ 
tion  is 

<  y\ 0)  >=<  yl{ 0)  >=<  j/KO)  >= . =<  2/^(0)  > 

<  2/;(0)2/i(0)  >=  0 ,i^j 

and  dW  represents  a  Wiener  process  which  is  independent  at  each  time  step 
and  the  noise  input  at  each  map  is  independent  from  other  maps.  Based  on 
these  facts  and  dW  ~  N(0,\Jdt),  we  can  deduce  <  dWidWj  >=  Sijdt.  A  more 
rigorous  explanation  is  given  by  Ito-calculus  as 

f  [  f  <  (YdW)(YdW)T  >  g  =  [  fgS, 

Jo  Jo  Jo 

where  /  and  g  are  integrable  functions  and 
<yl>  dt 

<  y 2  >  dt 

<2 H>  dt 

There  is  no  distinction  between  any  of  n  maps  because  each  map  starts  from 
the  same  condition.  The  statistical  structure  of  all  maps  must  be  the  same  in 
time,  which  leads  to 

<  y\  >=<  y\  >  = .  =  <  y‘n  >  ■ 

Therefore,  the  integral  equation  for  covariance  can  be  simplified  as 

<  Y(t)YT(t)  >  =  Qexp(2ep0Dt)QT 

+  a2Q(I  +  eD)  f  exp(2ey0Dt)(I  +  eD)QT  <  y2  >  dt' .  (27) 
Jo 

A  technical  difficulty  exists  for  handling  the  nxn  matrix  in  this  equation.  The 
simplest  case  is  of  3  identical  maps  that  are  connected  locally.  Even  in  the 
3-map  case,  we  expect  that  the  most  important  aspect  for  signal  transference 
remains.  For  3-map  case  we  have 
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If  we  insert  Q  and  D  into  the  integral  equation,  we  find  the  integral  equation 
for  <  y2  >  as 


<  y  >  = 


12 

-  +  -exp(-3ep0t) 


<  F2(0)  > 


21  + 


<  y2  >  dt'. 


(28) 


Differentiating  with  respect  to  t.  we  get 


jt  <  y2  >=  -2ep0exp(-3ep0t)  <  y2{ 0)  >  +cr2(^  +  |(1  -  ^e)2)  <  y2  > 


-  2epo<j2(l  -  j|e)2  J  exp(-3ep0(t  -  t'))  <  y2(t')  >  dt' , 


(29) 


and  multiplying  both  sides  by  exp(3e/Zo t)  and  letting  q  =<  y2  >  exp(3epo t),  we 
find 


^  -  3ey0Q  =  -2e^0  <  y2( 0)  >  +a2 

at 


o  +  o(!-o^ 


q 


-2ep,0a2(l-  '^e)2  J  qdt' . 


(30) 


Differentiating  again  with  respect  to  t  on  both  sides,  we  get 

- (3ew + r2 + r2(1  -  s)2)l + 2e<j2™(1  -  y)2? = °-  (31) 

The  initial  conditions  are  determined  in  the  previous  steps  as 

9(0)  =<  y2{ 0)  >,  q'{ 0)  =  {ey0  +  a2  +  ^cr2(l  -  ^e)2)  <  y2{ 0)  >  . 

A  solution  can  be  easily  constructed  for  the  second  order  differential  equation 
with  constant  coefficients. 


where 


<  y 2  >=  C1e(A i-3^o)t  +  Cr2e(A2-3£Mo)i)  (32) 


^1,2  —  +  g0’2  +  3cr2(1  -  9e)2) 

±  3£Mo  +  ^  +  ^0-  -  |e)2)2  -  2ecrVo(l  -  ^e)2  (33) 


and 


c,  =  ((w>  +  y2  +  5<e<i  -  ?e)2)  -  A2), 

°2  =  -  (<w>  +  («2  +  |»2(i  -  ?<02)).  (34) 

The  coefficients,  C\  and  C2,  are  calculated  from  the  two  initial  conditions.  For 
t  0,  <  y2  >  is  governed  by  Cie^1-3^0)*  and  hence  we  can  write 


<  y2  >  ~  C'ie(Al_3£M°)t, 


(35) 
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and  we  can  compare  this  result  with  the  single  map.  For  the  single  map,  the 
stochastic  solution  is  x{Q)e^°~^a  .  Because  W  follows  N(0,t),  it  is  pos¬ 

sible  to  find  the  <  x 2  >=  x2(0)e^fJ-o+a  '>t.  Figure  13  shows  the  comparison  of 
<  y2  >  between  the  single  map  and  the  locally  connected  map.  The  x-axis  rep¬ 
resents  the  interaction  coefficient  e  and  the  y-axis  is  the  difference  of  exponential 
exponent  between  the  two  cases.  In  this  figure,  Ai  —  3e/uo  —  <J2  is  negative  for 
all  interaction  coefficients.  The  decrease  of  growth  rate  compared  with  that  of 
the  single  map  is  caused  by  the  local  connection  which  suppresses  the  growth 
of  the  second  moment  of  each  map  in  the  locally  connected  maps. 


Figure  13:  comparison  of  <  y2  >  between  the  single  map  and  the  locally  con¬ 
nected  maps 


This  result  seems  not  to  support  the  generation  of  more  on  stages  in  locally 
connected  maps.  A  more  revealing  aspect  may  be  the  correlation  between  two 
adjacent  maps.  From  the  equation  (12),  we  can  find  an  equation  for  <  y\yi  >. 

<  ym  >  =  (\-  ^e"3^0*)  <  y2(0)  > 

+  ^  j\\  -  i(l  -  3e)2e-3,Ma(t-t'))  <y*>  (]t,  (36) 


In  this  equation  <  yiy 2  >  is  determined  by  <  y2  >  which  we  calculated  previ¬ 
ously.  Before  finding  the  value  of  the  integral,  we  discuss  its  structure.  Accord¬ 
ing  to  our  previous  calculation  <  y2  >  is  an  exponentially  increasing  function 
with  time.  The  correlation  <  J/1J/2  >  is  the  time  integral  of  <  y2  >.  Therefore, 
we  can  expect  that  <  j/ij/2  >  increases  exponentially  with  time.  If  we  only  care 
about  the  leading  order  form  of  the  solution,  we  can  approximate  <  yij/2  >  as 


<  2/12/2  >~ 


1 

Ai  —  3e/io 


(l-fe)2l 

^1 


e(Ai— 3  c/x0)i 


(37) 


Obviously,  the  coefficients  multiplying  the  exponential  are  positive.  So,  the 
correlation  between  two  adjacent  maps  increases  exponentially. 

Returning  to  our  linearized  coupled  equations,  we  can  apply  the  above  results 
to  interpret  each  term  in  the  equation  (22).  First,  our  underlying  assumption 
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is  that  the  system  is  unstable  such  that  the  signal,  yk,  increases  on  average. 
Temporarily,  yk  can  experience  a  sudden  or  steady  decrease  but  yk  eventually 
increases  if  we  consider  long  time  periods.  We  found  from  the  above  analysis 
that  the  yk  are  positively  correlated  with  the  yk+i  and  the  yk-i  ■  In  equation 
(22),  the  term  — e/io yk  decreases  the  growth  rate  of  <  y\  >  compared  with  that 
of  a  single  model.  The  term  eyo(yk+i  +  yk- 1  )/2  has  the  most  important  role  in 
this  locally  connected  system  due  to  the  fact  that  yk+i  and  yk- 1  are  positively 
correlated  with  yk-  If  yk  starts  to  move  away  from  the  fixed  point  (yk  =  0),  it 
provides  positively  correlated  signals  to  yk-  More  specifically,  if  yk  is  negative, 
this  term  provides  negative  values  for  yk  at  the  next  time  step.  If  yk  is  positive, 
the  opposite  sequence  occurs.  This  term  drives  yk  away  from  the  fixed  point. 
When  we  consider  other  terms  (1  -  e)a^kyk  +  ecr(£ k+iyk+i  +  £fc-i2/fc-i)/2,  we 
must  recall  that  yk,  yk+ 1  and  yk- i  are  positively  correlated,  and  £fc+i,  £fc  and 
£fc_i  are  independent  Gaussian  white  noise  terms.  The  sum  of  these  noisy  terms 
is  negligible  compared  with  the  other  terms.  As  a  result,  local  connection  sets 
up  the  condition  for  the  positive  correlation  between  adjacent  systems,  which 
provides  the  positively  correlated  signals  to  one  specific  map. 

Our  calculation  in  the  context  of  a  linearized  local  connection  gives  a  picture 
of  how  the  signals  among  adjacent  maps  evolve  and  transfer  during  a  long  time 
period  of  the  off  stage.  Under  unstable  conditions,  adjacent  maps  together  have 
the  tendency  to  escape  from  the  fixed  point.  This  process  is  rapidly  accelerated 
due  to  the  transfer  of  positively  correlated  signals  from  one  specific  map  to 
adjacent  maps.  This  mechanism  can  explain  the  generation  of  more  on  stages 
when  we  increase  the  coupling  coefficient  e. 

5  Conclusion 

The  Ricker  Map  which  is  represented  as  Nt+ 1  =  Ntert^1~Nt'>  has  a  fixed  point 
at  Nt  =  1.0.  The  fixed  point  is  stable  or  unstable  depending  on  the  value 
of  7*t  relative  to  2.0.  If  rt  is  driven  by  Gaussian  white  noise  near  rt  =  2.0, 
on-off  intermittency  is  generated.  The  on-off  intermittency  in  the  Ricker  Map 
has  characteristics  consistent  with  those  discussed  in  previous  work.  The  power 
spectrum  for  on-off  intermittency  shows  a  clear  slope  in  log-log  scale  plots.  Most 
importantly,  the  probability  density  function  (j)  for  the  duration  of  the  off  stages 
in  the  Ricker  Map  is  also  proportional  to  (/>-3/2.  This  consistency  leads  us  to  a 
more  realistic  consideration  of  the  Ricker  Map. 

The  Ricker  Map  is  used  for  the  evolution  of  the  population  of  one  species 
at  one  generation.  A  realistic  environment  in  this  species  is  not  isolated  from 
other  similar  systems.  Therefore,  on-off  intermittency  in  the  Ricker  Map  is 
investigated  under  a  locally  connected  situation.  That  mimics  the  possible  in¬ 
teraction  among  neighboring  species  under  the  same  environmental  conditions. 
Even  though  the  exact  form  of  interaction  is  not  shown,  as  a  first  step,  we  use 
a  linear  coupling  with  adjacent  maps  based  on  the  coupling  coefficient  e.  The 
magnitude  of  e  represents  the  intensity  of  the  interaction.  According  to  numer¬ 
ical  simulations,  the  statistics  of  on-off  intermittency  are  changed  relative  to 
a  single  map  that  has  the  same  parameters.  Moreover,  the  stability  condition 
changes  due  to  local  coupling.  Within  some  range  of  er,  the  locally  connected 
Ricker  maps  become  unstable  even  though  the  single  map  is  stable  under  the 
same  parameters.  The  mechanism  for  explaining  the  change  caused  by  the  lo- 
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cal  coupling  must  be  strongly  related  to  the  evolution  and  transfer  of  signals 
between  adjacent  maps. 

Signal  evolution  and  transfer  of  information  in  a  locally  coupled  system  was 
considered  theoretically  using  a  time-continuous  model  similar  to  the  Ricker 
Map.  This  continuous  model  enables  us  to  use  stochastic  calculus  to  make 
progress.  We  found  that  the  signal  in  a  specific  map  belonging  to  the  locally 
coupled  system  is  positively  correlated  with  the  signal  in  adjacent  maps.  The 
positively  correlated  signals  that  come  from  adjacent  maps  are  added  to  the  sig¬ 
nal  in  the  map  for  the  next  time  step.  Under  unstable  conditions,  this  feedback 
due  to  correlations  with  adjacent  maps  causes  the  signal  to  accelerate  away  from 
the  fixed  point,  which  explains  the  change  of  statistics  of  on-off  intermittency 
under  the  local  coupling. 
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Abstract 

This  project  investigates  the  behaviour  of  a  treadmilling  microorganism  in  a  two- 
dimensional  unbounded  domain  with  a  half-infinite  non-slip  wall.  The  governing  evo¬ 
lution  equations  for  such  a  treadmilling  organism  are  derived  in  an  analytical  form 
and  trajectories  of  a  microorganism  for  several  different  initial  positions  and  heading 
directions  are  numerically  calculated.  Then  we  define  and  numerically  calculate  the  es¬ 
caping  probability  Pe ,  the  probability  that  the  treadmilling  organism  can  escape  from 
its  image  on  the  wall.  This  Pe  does  not  take  0  or  1  value  for  any  physically  reasonable 
initial  point.  This  suggests  that  the  treadmilling  organism  feels  the  presence  of  its  image 
on  the  wall  even  when  it  is  placed  at  a  position  far  from  the  wall  at  the  initial  time, 
and  that,  if  the  initial  heading  direction  is  appropriately  chosen,  it  is  possible  for  the 
microorganism  to  escape  from  the  wall. 


1  Introduction 

The  locomotion  of  microorganisms  has  received  much  attraction  in  fluid  dynamics  and 
Biology  recently  [1,  2].  As  their  motion  possesses  very  small  length  scales  and  speeds,  the 
dynamics  is  governed  by  low  Reynolds  number  hydrodynamics,1  where  inertial  forces  are 
negligible  in  comparison  to  the  viscous  effects  of  the  fluid. 

Many  studies  have  been  undertaken  on  such  dynamics  in  unbounded  low  Reynolds  num¬ 
ber  flows  [3,  4,  5].  In  reality,  however,  most  organisms  are  in  the  vicinity  of  other  bodies 
or  boundaries,  where  their  hydrodynamic  interactions  with  these  have  a  significant  effect 
on  their  motion.  The  importance  of  the  existence  of  boundaries  has  also  been  suggested 
by  many  experimental  observations.  For  example,  some  experimental  observations  [6,  7,  8] 
found  a  qualitative  feature  of  microorganism  near  a  solid  wall  to  be  attracted  to  no-slip 
boundaries.  Berke  etal.  [9]  measured  the  steady-state  distribution  of  smooth-swimming 
Escherichia  Coli  (E.  Coli)  between  two  glass  plates  and  found  a  strong  increase  of  the  cell 
concentration  at  the  boundaries.  They  also  theoretically  demonstrated  that  hydrodynamic 
interactions  of  swimming  cells  with  solid  surfaces  lead  to  their  reorientation  in  the  direc¬ 
tion  parallel  to  the  surfaces,  as  well  as  their  attraction  to  the  closest  wall.  Lauga  etal.  [10] 
showed  that  circular  trajectories  are  natural  consequences  of  their  force- free  and  torque- free 
swimming  and  hydrodynamic  interactions  with  the  boundary.  This  leads  to  a  hydrodynamic 


1For  example,  the  Reynolds  number  associated  with  the  motion  of  Escherichia  coli  (E.  Coli)  is  0(10  4) 
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trapping  of  the  cells  close  to  the  surface.  Drescher  etal.  [11]  found  that  when  two  nearby 
Volvox  colonies  swim  close  to  a  solid  surface,  they  attract  one  another  and  can  form  stable 
bound  states  in  which  they  waltz  or  minuet  around  each  other.  These  observations  suggest 
that,  in  order  to  obtain  a  comprehensive  understanding  of  low  Reynolds  number  locomo¬ 
tions,  it  is  necessary  to  study  hydrodynamic  interactions  between  microorganism  and  the 
boundaries. 

Some  of  the  phenomena  stated  above  have  already  been  verified  by  numerical  simulations 
[12,  13].  However,  not  many  physical  explanations  have  been  given  to  the  locomotion  of 
microorganism  near  boundaries.  Berke  etal.  [9]  studied  the  swimming  microorganisms’ 
attraction  to  boundaries  by  modeling  the  swimmer  as  a  force  dipole  singularity.  This  simple 
model  captures  the  attraction  of  the  microorganisms  to  boundaries.  However,  contrary  to 
the  experimental  findings,  the  microorganism  in  this  model  crashes  into  the  boundary  in 
finite  time.  Moreover,  their  analysis  did  not  investigate  the  dynamics  of  microorganism 
near  the  boundary.  Or  and  Murray  [14]  studied  the  dynamics  of  low  Reynolds  number 
swimming  organism  near  a  plane  wall.  They  analysed  the  motion  of  a  swimmer  consisting 
of  two  rotating  spheres  connected  by  a  thin  rod,  as  a  simple  theoretical  model  of  swimming 
organisms.  They  found  that  when  the  spheres  are  rotated  in  unequal  velocities,  the  swimmer 
has  a  solution  of  steady  translation  parallel  to  the  wall,  and  that  under  small  perturbation, 
the  swimmer  exhibits  wave-like  motions  along  the  wall.  These  results  have  recently  been 
verified  experimentally  on  a  macroscale  robotic  prototype  swimming  in  a  highly  viscous  fluid 
[15].  Furthermore,  Crowdy  and  Or  [16]  have  proposed  a  singularity  model  for  swimming 
microorganisms  placed  near  an  infinite  no-slip  boundary.  Their  model  was  based  on  a 
circular  treadmilling  organism  which  has  no  means  of  self-propulsion.  They  proposed  the 
appropriate  Stokes  singularities  that  represented  the  flow  field  created  by  this  treadmilling 
organism  in  the  global  fluid.  By  studying  the  interaction  between  these  singularities  and  the 
no-slip  wall,  they  formulated  the  nonlinear  dynamics  of  the  treadmilling  organism  explicitly, 
and  fully  characterised  its  motion  near  the  wall.  The  trajectories  parallel  to  the  wall  and 
periodic  bouncing  motion  along  the  wall  found  in  Crowdy  and  Or  [16]  had  remarkable 
similarities  to  the  trajectories  shown  in  Or  and  Murray  [14].  Crowdy  and  Samson  [17], 
utilising  the  singularity  model,  investigated  the  dynamics  of  treadmilling  organism  near  an 
infinite  no-slip  boundary  with  a  gap  of  a  fixed  size.  They  employed  a  conformal  mapping 
technique  to  avoid  the  difficulty  in  treating  the  image  of  the  treadmilling  organism  on  the 
wall.  In  their  model,  the  treadmilling  organism  was  found  to  make  several  qualitatively 
different  trajectories;  jumping  over  the  gap,  rebounding  from  the  gap,  trapping  near  the 
gap,  and  escaping  the  gap  region  even  when  the  organism  has  the  initial  point  in  the 
gap.  They  also  performed  a  bifurcation  analysis  as  the  size  of  the  organism  varies,  and 
demonstrated  a  presence  of  stable  equilibrium  points  in  the  gap  region  as  well  as  Hopf 
bifurcations  to  periodic  bound  states.  This  reduced  model  system  also  exhibited  a  global 
gluing  bifurcation  in  which  two  symmetric  periodic  orbits  merge  at  a  saddle  point  into 
symmetric  bound  states  having  more  complex  spatio-temporal  structure. 

The  present  project  examines  the  dynamics  of  a  treadmilling  organism  near  a  half¬ 
infinite  no-slip  wall  (a  corner  whose  angle  is  7r).  This  is  an  important  preliminary  step  to 
clarifying  the  dynamics  of  a  treadmilling  organism  near  a  no-slip  corner  with  an  arbitrary 
angle,  which  will  shed  light  on  the  cases  with  more  complicated  realistic  geometries. 
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Two-dimensional  Stokes  flow  problem  and  Goursat  func¬ 
tions 


2.1  General  Solution  for  Stokes  equation 

In  many  of  existing  three-dimensional  theoretical  models  of  low  Reynolds  number  locomo¬ 
tions,  swimming  microorganisms  possess  symmetry  about  their  sagittal  plane  [14,  18]  or  even 
about  their  longitudinal  axis  [4,  5,  19].  This  suggests  that  even  in  a  fully  three-dimensional 
flow,  the  motions  of  microorganisms  are  confined  to  a  two-dimensional  plane.  Consequently, 
here  we  consider  the  two-dimensional  model  for  a  microorganism  in  a  (x,  z/)-plane,  in  which 
we  shall  be  free  to  treat  the  flow  in  a  complex  plane  (z-space),  where  z  =  x  +  iy.  While  the 
complex  function  method  is  one  of  the  classical  methods  applied  to  ideal  flows,  a  similar 
analytical  method  may  also  be  applied  to  two-dimensional  Stokes  flows  [20,  21].  Here,  the 
complex  method  for  Stokes  flows  is  introduced  to  show  that  the  two-dimensional  Stokes 
flow  is  expressed  through  a  couple  of  complex  functions.  This  helps  us  to  understand  the 
flow  clearly. 

Stokes  equations  which  describe  the  motion  of  an  incompressible  viscous  fluid  are 

Vp  =  r]  Au, 

V  •  u  =  0, 

where  A  is  a  Laplace  operator,  u  =  (ux,uy)  is  the  fluid  velocity,  p  and  q  are  the  pressure 
distribution  and  dynamic  viscosity,  respectively.  As  we  are  considering  a  two-dimensional 
flow,  we  can  introduce  the  stream  function  ip,  such  that  the  velocity  is  given  by 


ux  =  dip/dy,  uy  =  —dip/dx, 


(2) 


then  the  Stokes  equations  (1)  reduce  to  biharmonic  equation 


A2ip  =  0. 


Now  we  define  complex  velocity  W  as 


dip  dip 

ux  +  iuy  =  - z  — 

ay  ay 


=  -2i^. 

dz 


The  vorticity  uj  =  —'S7Iip  is  then  given  by 


„2  ,  dW 

lo  =  —\7~ip  =  2i— — . 

dz 

By  substituting  (2)  and  (4)  into  the  Stokes  equations  (1),  we  obtain 

dp  duj  dp  du 
dx  ^  dy'  dy  ^  dx ' 


(3) 


(4) 

(5) 


The  equations  in  (5)  are  the  Cauchy-Riemann  equations  on  the  pair  p  and  —  qu.  It  follows 
that  p  —  iqu  is  an  analytic  function  of  z,  and  so  that  it  can  be  written  as 


p  —  ir/oj  =  Aqf'(z), 


(6) 
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Figure  1:  Schematic  description  of  a  (a):  Stokes  dipole  (b):  stresslet  (b):  rotlet. 

where  f\z )  is  an  arbitrary  analytic  function  of  z.  Taking  the  imaginary  part  of  (6),  it  is 
obvious  that,  by  means  of  the  function  f(z),  oj  can  be  written  as 

u  =  —  41m  [f'{z)\  .  (7) 

Substituting  (7)  into  (4)  and  integrating  it  with  respect  to  z,  we  obtain 

W  =  ux  +  iuy  =  f(z)  +  zf'{z)  +g'(z),  (8) 

where  g'(z)  is  an  arbitrary  analytic  function  of  z.  Again,  substituting  (8)  into  (4)  and 
integrating  it  with  respect  to  z,  we  finally  obtain  the  general  solution  for  tjj  in  (3)  as 

Im  [zf(z)  +  g(z)\.  (9) 

The  fictions  f(z)  and  g(z),  which  are  called  Goursat  functions,  are  analytic  everywhere  in 
the  considered  domain,  except  when  isolated  singularities  are  introduced  in  order  to  model 
various  flows  of  interest.  The  functions  zf(z)  and  g(z)  in  (9)  correspond  to  the  viscous  and 
potential  part  of  the  flow,  respectively.  Note  that  g{z)  does  not  appear  (7),  since  potential 
flow  has  zero  vorticity. 

2.2  Singular  solutions  of  Stokes  flow 

The  Goursat  functions  f(z )  and  g(z)  which  determine  the  stream  function  of  the  flow  are 
analytic  almost  everywhere  in  a  considered  domain  except  at  isolated  singular  points.  It  is 
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necessary  to  choose  appropriate  singularities  for  the  Goursat  functions  in  order  to  describe 
the  flow  of  interest.  What  kind  of  singularities  should  we  use  in  order  to  model  the  flow 
generated  by  a  treadmilling  organism  near  a  no-slip  half-infinite  wall?  This  is  one  of  the 
main  problem  we  should  cope  with  for  the  formulation  of  the  governing  evolution  functions 
of  the  treadmilling  organism,  and  this  will  be  discussed  later  in  §3.1.  Here  in  this  section, 
the  basic  singularities  of  the  Stokes  flow  are  discussed.  The  flow  generated  by  a  treadmilling 
organism  may  be  modeled  with  some  of  these  singularities. 

First  of  all,  let  us  consider  f(z )  of  the  form 

f(z)  =  g\og(z- zd),  (10) 


where  /j  £  C  is  the  strength  of  the  singularity.  Then  we  insist  that  the  complex  velocity, 
given  by  (8), 


ux  +  iuy  =  -g  log (2;  -  zd)  +  +  g' (z) 

Z  Zd 

1  (  s,z(z~Zd)  ~pZd 

=  -glog (z  -  zd)  +  — — =—  +  = — =  +  g  {z) 
z-  zd  z-  zd 

should  be  both  single-valued  and,  at  least,  logarithmically  singular  at  zd. 
constraints,  g{z)  should  be  chosen  to  have  the  form  which  satisfies 

g'(z,t)  =  -~p\og(z  -  zd). 

Z  zd 

Hence,  if  the  Goursat  functions  f(z)  and  g(z)  are 

f(z)  =  /ilog (z  —  zd)  +  analytic  function, 

ilZd 

g'{z)  =  — — — —  —  g\og(z  —  zd)  +  analytic  function, 

z  zd 


(11) 

From  these 

(12) 


(13) 


the  flow  described  by  these  Goursat  functions  (13)  is  the  one  generated  by  Stokeslet  at  zd. 
The  complex  velocity  (11)  takes  the  form 

ux  +  iuy  ~  -g  log  |  z  -  zd  | 


and  grows  logarithmically  as  z  — >•  00.  This  is  the  heart  of  the  Stokes  Paradox.  In  order  to 
avoid  the  Stokes  paradox,  we  must  restrict  our  attention  to  Stokes  flow  problems  which  do 
not  contain  any  Stokeslet.  Note  that  imposing  certain  singularities  to  f(z)  forces  g  some 
concomitant  singularities.  Stokes  dipole  is  obtained  by  superposing  two  Stokeslet,  drawing 
infinitesimally  close  together,  and  with  equal  and  opposite  strengths  tending  to  infinity  at 
a  rate  inversely  proportional  to  their  separation.  This  is  schematically  shown  in  Fig.l  (a). 

Next,  if  we  allow  f(z)  to  have  a  simple  pole  at  zd, 

m  -  ^  in) 

where  g  G  C,  and  insist  that  the  complex  velocity,  given  by  (8),  should  be  singular  like 
1/1  z  —  zd |,  not  line  1/| z  —  zd |2.  Then,  again,  the  singularity  in  f(z)  forces  g(z)  to  have 
singularities,  and  we  find  that  g(z )  should  be  chosen  to  have  the  form  which  satisfies 


g'(z) 


gzd 

(z  -  zX)2 ' 


(15) 
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This  g(z)  corresponds  to  an  irrotational  dipole  contribution  of  strength  —gzd.  If 

f(z ) — — - K  analytic  function, 

z  -  zd 

g'(z)  =  ^  0  +  analytic  function, 

(z  -  zdy 


(16) 


we  say  that  there  is  a  stresslet  of  strength  g  at  zd.  This  is  given  by  the  addition  of  two 
Stokes  dipoles  oriented  at  90°  with  respect  to  each  other  as  schematically  shown  in  Fig.l 
(b). 

When  f(z)  has  singularities,  as  already  stated  above,  g(z)  should  have  concomitant 
singularities  to  those  in  f(z),  but  not  conversely;  g(z)  can  have  singularities  which  is  in¬ 
dependent  of  those  of  f(z).  These  are  kinematically  identical  to  the  singularities  of  the 
complex  potential  flow  in  two-dimensional  incompressible  and  irrotational  flow.  Let  us 
allow  g{z)  to  admit,  for  instance,  a  logarithmic  singularity,  which  means  simple  pole  in 
£/'(z), 

g{z)  =  c\og{z- zd),  (17) 

where  c  €  R  or  G  C.  If  c  €  R  zd  is  a  source/sink,  whilst  if  c  €  C  zd  is  a  rotlet(or  couplet). 
A  rotlet  is  given  by  the  difference  between  two  Stokes  dipoles  oriented  at  90°  with  respect 
to  each  other  as  schematically  shown  in  Fig.l  (c),  and  this  is  kinematically  identical  to  a 
point  vortex  [22],  Similarly,  a  double  pole  of  g'{z )  corresponds  to  an  irrotational  dipole 
singularity,  a  triple  pole  corresponds  to  an  irrotational  quadrupole  and  so  on. 

With  regard  to  modelling  a  swimming  micro-organism,  we  are  free  to  choose  any  com¬ 
bination  of  such  Stokes  flow  singularities  which  seems  to  be  appropriate  for  the  problem  at 
hand.  However,  each  of  these  singularities  will  be  a  function  of  the  swimmers  size,  shape, 
its  swimming  protocol  and  its  local  effect  on  the  fluid  around  it.  The  attention  is  therefore 
focused  on  what  specific  combination  of  singularities  can  be  used  to  accurately  model  a 
physical  microorganism. 


3  The  model  for  a  treadmilling  tiny  swimming  organism 

3.1  Basic  settings  and  boundary  conditions 

Here  we  describe  our  simple  model  for  a  treadmilling  organism.  Any  swimming  organism 
in  a  Stokes  flow  locally  generates  a  flow  that  can  be  modelled  by  some  combination  of 
Stokes  flow  singularities  described  in  §2.2  placed  on  its  position  or  its  boundary.  This 
singularity  distribution  is  generally  a  complicated  function  of  the  swimming  organism’s 
basic  properties  such  as  size,  shape,  and  configuration  of  cilia.  The  treadmilling  organism 
does  not  have  any  self-propulsion  in  itself,  but  moves  around  in  the  considered  domain 
when  there  is  a  boundary  near  it  [6,  7,  10,  9,  11,  12,  13,  9,  14,  15,  16,  17]2.  It  suggests 
that  if  a  treadmilling  organism  is  influenced  by  its  surroundings  such  as  other  swimming 
organisms  or  a  solid  boundary,  and  behaves  in  a  different  way  from  when  it  is  in  isolation. 

2  Strictly,  “treadmilling”  and  “swimming”  describe  different  behaviours  of  the  organism,  and  here  in  this 
project,  we  are  only  considering  treadmilling  organisms.  However,  as  a  treadmilling  organism  can  move  in 
the  flow  when  there  is  a  no-slip  wall  around  it,  we  occasionally  use  the  description  ’’swimming  organism” 
when  we  focus  upon  the  organism’s  moving  behaviour  in  the  flow. 
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Figure  2:  Simple  model  for  a  treadmilling  organism.  Zd,  e,  and  6  are  position  of  the  organism, 
radius  of  the  circular  body  of  the  organism,  and  the  angle  of  the  head  of  the  organism  from 
the  real  axis,  respectively. 


This  means  that,  strictly  speaking,  the  singularities  in  the  Goursat  functions  referred  above 
should  be  replaced  by  effective  singularities,  which  will  be  complicated  functions,  according 
to  the  treadmilling  organism’s  surroundings.  However,  as  a  first  approximation  model 
of  a  treadmilling  organism’s  hydrodynamic  interaction  effects  with  a  wall,  it  seems  to  be 
reasonable  to  consider  the  treadmilling  organism  as  passive.  It  follows  that  the  treadmilling 
organism  always  keeps  its  characteristics  irrelevant  to  its  surroundings. 

In  a  simple  theoretical  model,  we  assume  that  the  treadmilling  organism  has  a  circle 
body  of  radius  e,  with  a  moving  centre  Zd{t )  =  Xd(t)  +  iyd(t)  in  the  fluid.  We  also  assume 
that,  with  respect  to  the  angle  of  the  head  of  the  treadmilling  organism  from  the  real  axis, 
9(t)  ,  surface  actuators  of  the  treadmilling  organism  induce  a  tangential  velocity  profile 
given  by 


U{4>,t)  =  2V  sin  (2((j>  —  9)) 

=  c(t)  exp(2i(j))  +  c(t)  exp(— 2i(j>),  c(t)  =  —  zF  exp(— 2i  0(t)).  (18) 

Here  V  is  a  constant  whose  magnitude  sets  the  time  scale  for  the  treadmilling  action,  cj)  is 
the  angle  measured  from  the  positive  x  direction  and  <p  =  6  is  the  direction  of  the  head  of 
the  treadmilling  organism  (Fig. 2).  Note  that  when  the  treadmilling  organism  produces  a 
time-dependent  tangential  velocity  profile,  the  Stokes  equations  imply  that  the  fluid  reacts 
to  it  instantaneously.  Also  note  that  in  unbounded  fluid,  this  velocity  profile  does  not  result 
in  any  self-propulsion  of  the  organism  due  to  its  symmetry  about  real  and  imaginary  axes. 

The  two  Goursat  functions  f(z,t)  and  g(z,t),  which  give  the  stream  function  of  the 
flow,  should  satisfy  two  boundary  conditions:  boundary  conditions  on  the  no-slip  wall  and 
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on  the  surface  of  the  treadmilling  organism.  The  boundary  condition  on  the  no-slip  wall  is, 
from  (8), 

u  +  iv  =  —f(z,  t )  +  zf'(z,  t)  +  g'{z,  t)  =  0  on  the  wall.  (19) 

Next,  suppose  that  the  centre  of  the  treadmilling  organism  Zd  moves  in  a  translational 
velocity  xd(t)  +  iyd(t),  and  the  organism  rotates  with  an  angular  velocity  Then  the 

boundary  condition  of  the  velocity  field  around  the  treadmilling  organism  can  be  written 
as 


dz 


ux+iuy  =  -  f(z,t)+zf(z,t)+g'(z,t )  =  xd(t)+iyd(t)+[eQ  +  U(<j),6(t))]  — ,  at  {z\  \z-zd\  =  e}, 


where  dz/ds  is  the  complex  unit  tangent  to  the  boundary.  Here,  the  relations 


(20) 


z  zd  — 


z  —  zd  ds 


dz  _  (z  -  zd) 


have  been  used. 


3.2  Singularities  of  f(z,t),g(z,t)  and  governing  evolution  equations  of  a 
treadmilling  organism  near  a  half-infinite  wall 


As  it  was  mentioned  in  §2,  the  flow  generated  by  a  treadmilling  organism  in  a  Stokes 
flow  can  be  locally  modelled  by  some  combination  of  Stokes  flow  singularities.  However, 
it  is  not  obvious  what  singularities  should  be  chosen  for  each  considered  situation.  A  low 
Reynolds  number  treadmilling  organisms  exert  no  net  force  or  torque  upon  the  flow  around 
it.  Consequently,  here  we  introduce  a  mathematical  representation  of  the  treadmilling 
organism  as  a  two-dimensional  stresslet,  which  is  accompanied  by  additional  singularities 
to  those  for  the  stresslet  in  its  Goursat  functions.  Note  that  neither  a  Stokeslet  nor  a  rotlet 
should  be  used  for  modelling  the  treadmilling  organism,  since  the  treadmilling  organism 
should  be  force-free  and  torque-free.  The  type  of  the  singularities  in  Goursat  functions 
should  be  chosen  such  that  the  Goursat  functions  satisfy  two  boundary  conditions  (19) 
and  (20).  For  this,  we  employ  “image  system  method”  [23,  16].  It  is  done  by  introducing 
additional  singularities  at  z  =  Zd,  physically  understood  as  the  image  of  the  treadmilling 
organism,  to  the  Goursat  functions. 

The  determination  of  the  Goursat  functions  is  basically  performed  by  trial  and  error. 
Here  we  seek  solutions  for  these  Goursat  functions  having  the  form 

z,t)  =  — ' +  fo  +  fi(z  -  zd(t)) -t - , 

9  (Z,t^  {z-zd{t)f  +  {z-zd(t)Y  + 


f(z ,  t)  having  no  Stokeslet  and  g(z,  t)  having  no  rotlet  imply  that  the  treadmilling  organism 
is  force-free  and  torque-free.  Now  we  use  the  boundary  condition  (20)  to  find  relations 
between  the  coefficients  in  (21)  and  the  velocity  of  the  treadmilling  organism.  On  the 
surface  of  the  treadmilling  organism,  where  | z  —  zd\  =  e, 


z  zd  — 


e 


2 


Z-Zd 


dz 

ds 


.z  -  zd 

i - 


e 


(22) 
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hold.  Substituting  (18)  and  (22)  into  (20)  then  equating  coefficients  of  different  powers  of 
z  —  Zd,  we  obtain 


Xd  +  Wd  =  ~fo  +  fiZd  +  9o, 
itt  =  -  (/i  -  fi)  , 

H  =  —iec,  (23) 

a  =  gz^, 

b  =  ge-  ice 3  =  2jie2 . 


Thus  the  Goursat  functions  f(z,t )  and  g'(z,t )  have  the  form 


f(z,t ) 


5'(M) 


„  +  /o(0  +  /iW(z  -  ^d(*))  +  O  {(z  ~  zd{t))2 )  , 


2g{t)e2 

(z-zd(t))3) 


H(t)zd{t) 

(z-zd(t))2) 


+  ffo(*)  +  ^  {{z 


Zd(t ))) , 


(24) 


around  zd.  This  means  that  the  treadmilling  organism  has  an  equivalent  point  singularity 
description  with  a  stresslet  of  strength  //,  corresponds  to  the  terms  y/(z  —  zd)  in  f(z)  and 
(fiZd) / (z  —  zd)2  in  g'(z),  and  the  quadrupole  of  strength  2e2/u  at  zd,  corresponds  to  the  term 
(2 e2fi)/(z  —  zd)3  in  g'{z).  Note  that  the  existence  of  the  quadrupole  enables  this  model  to 
describe  a  treadmilling  organism  with  a  finite  size;  for  the  pure  point  treadmilling  organism, 
e  =  0,  the  quadrupole  vanishes.  We  also  set  the  time  scale  of  the  motion  by  letting  V  =  W1 
so  that 

y(t)  =  exp  (2 i0(t)) .  (25) 

Since  the  treadmilling  organism  is  considered  as  a  passive  scalar  here,  the  temporal 
change  of  its  position  follows  the  velocity  of  the  fluid  around  it.  Here,  however,  we  should 
be  aware  that  the  velocity  components  which  originates  from  the  position  of  the  treadmilling 
organism  cannot  affect  the  treadmilling  organism  itself  by  definition  of  “treadmilling” .  With 
these  consideration,  i  =  xd  +  iyd  is  given  by  the  analytic  part  of  the  flow  velocity  (8)  at 
z  =  zd,  and  can  be  exactly  written  as 


dzd 

dt 


-fo  +  zdfi  +  go- 


(26) 


Similarly,  the  temporal  change  of  the  heading  angle  of  the  treadmilling  organism  is  given 
from  the  half  of  the  analytic  part  of  the  vorticity  of  the  flow  (7)  at  z  =  zd- 


rW 

Tt  =  -21m  [/,].  (27) 

Hence,  from  (26)  and  (27),  if  we  know  fo,  fi,  and  go  of  f(z)  and  g(z)  that  satisfy  the 
conditions  (19)  and  (20),  we  obtain  the  governing  dynamical  system  of  the  treadmilling 
organism  in  an  explicit  form  and  know  its  trajectory. 
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Figure  3:  Considered  domain  in  2-space  and  (-space.  The  points  A  and  B  in  both  spaces 
are  mapped  to  each  other  by  the  conformal  mapping  (  =  iz1/2  and  its  inverse  mapping 
2  =  z{Q)  =  — (2,  then  the  infinite  wall  in  (-space  corresponds  to  the  half-infinite  wall  in 
2-space. 

3.3  Conformal  mapping 

The  Goursat  functions  in  (24)  are  the  solution  for  the  stream  function  for  the  flow  generated 
in  the  vicinity  of  the  treadmilling  organism  without  accounting  for  the  effect  of  the  wall. 
To  know  the  Goursat  functions  that  have  the  form  (24)  near  Zd  and  that  also  satisfy  a 
boundary  condition  ux  +  iuy  =  0  on  the  wall,  we  have  to  find  the  coefficients  /o,/i  and 
go  such  that  the  velocity  satisfies  the  boundary  condition  on  the  wall.  It,  however,  is  not 
straightforward  to  determine  such  coefficients  in  a  region  near  a  half-infinite  wall  of  an 
infinitesimal  width  in  2-space,  since  where  to  place  the  image  of  the  swimmer  is  not  clear  as 
we  are  now  considering  a  half-infinite  wall  of  infinitesimal  width.  To  treat  this  complicated 
situation,  here  we  employ  conformal  mapping  techniques.  Although  the  boundary  value 
problem  we  have  here  is  not  conformally  invariant,  these  techniques  still  work,  since  we 
always  come  and  go  to  the  two  spaces, 
let  us  consider  a  mapping 

(  =  iz1/2.  (28) 

This  maps  the  domain  {z  £  C}  to  {(  |  Im(()  >  0},  especially,  the  domain  {z  |  Iiri(z)  = 
0,  Re(2)  <  0}.  The  half-infinite  wall  we  are  considering  in  2-space  are  mapped  to  the  domain 
{(  |Im(()  =  0},  the  infinite  wall  in  (-space(Fig.3).  The  inverse  of  this  conformal  mapping 
(28)  can  be  explicitly  written  as 

2  =  -(C)  =  -(2-  (29) 

When  written  as  a  function  of  2,  the  function  ((2)  has  the  same  type  of  point  singularity 
with  the  one  required  for  f(z)  and  g{z).  It  follows  that  (  can  serve  as  a  uniformisation 
variable  for  the  problem,  and  that  if  we  define  the  composed  functions 

F(()  ee  /  (2(0), 

=  M  ()), 

F(()  and  G(()  can  be  written  as  single-valued  functions  of  the  variable  (  and  are  analytic 
except  at  the  position  of  the  isolated  singularities  we  impose  to  these  functions  to  describe 
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the  treadmilling  organism  and  its  image.  In  other  words,  by  utilising  this  uniformising  ( 
(([-space),  we  can  write  two  analytic  functions  which  determine  the  flow  as  single- valued 
functions  with  no  branch  point  singularities. 

From  Crowdy  and  Or  [16]  and  Crowdy  and  Samson  [17],  we  propose  the  function  F(Q 
to  have  the  form 

A  B  CD 

-  c  -  Cj  +  (c  -  Q)3  +  (C  -  Cd)2  +  (c  -  &) 

where  we  can  set  E  to  be  zero  without  loss  of  generality  because  of  an  additive  degree  of 
freedom  in  the  definition  of  f(z).  Cd  corresponds  to  the  image  of  the  swimmer  on  the  wall, 
and  in  the  ([-space,  this  is  simply  a  complex  conjugate  of  Cd-  Utilising  this  and  the  inverse 
mapping  (29),  we  define  the  image  of  the  swimmer  in  2-space  as 


Zd  =  z((d)  =  z(( d). 

Recalling  the  non-slip  boundary  condition  on  the  wall  in  2-space  (19),  G(C)  should  satisfy 

G(C)=W)-zf'(z) 

=  HO  -  HCF^C(-*C)_2^/(C) 

=  ^(0  ~cF^(-<cr2m 

where,  we  have  utilised  the  fact  that  C  =  C  on  the  wall  in  ([-space.  Now,  as  C  =  |CI  exP(*0)>  0  = 
0  or  7T  on  the  wall, 


(-*C )2(-*C)  2  =  [|CI  exp(i(-vr/2  +  c/>))]2  [|C|  exp(i(-7r/2  +  </>))]  2 
=  [exp(i(vr  -  2 (j)))]2  =  1 

holds,  so  that 

G'(C)=i?(C)-^Ci?'(0-  (32) 

By  substituting  (31)  into  (32),  we  obtain 


G(C) 


A  B  C  D 

C  -  Cd  +  (C  -  Cd?  +  (C  -  Cd)2  +  (C  -  Cd) 


-A  -3  B  -2  C  -D 

_(C-Cd)2  +  (C-Cd)4  +  {c-q)3  +  (c-o)2.  ‘ 


(33) 


As  Cd  is  outside  the  considered  domain  in  ([-space,  he.  in  the  wall,  G(C)  only  has 
singularity  at  ([  =  Cd-  Then  it  follows  that  g(z)  has  singularities  at  z  =  Zd  and  Moffatt,  type 
singularity [24]  at  2  =  0. 
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Now  we  have  to  determine  the  unknown  coefficients  i-DsC.  For  A,  we  use  the  fact 
that  f(z( £))  has  to  have  the  singularity  near  zd  as  shown  in  (24).  Performing  the  Laurent 
expansion  of  1/  (£  —  Q)  in  terms  of  2  around  zd  and  comparing  (24)  and  (31),  we  obtain 


A  =  i 


l-i 


(34) 


To  determine  the  rest  of  the  unknown  coefficients,  B  —  D,  we  insist  that  g'  has  the  required 
singularity  structure  in  (24)  at  2  =  zd-  If  we  rewrite  G(C)  as  a  function  of  2  and  perform 
Laurent  expansion  in  terms  of  2  around  Zd  and  compare  it  with  g\z )  in  (24),  we  obtain 


B  =  i 

C  = 


e2H 


42TV2’ 

m( 3e2  -  4zdz^  +  4zf) 


D  =  i  — 


8  Zd2 

M  (3e2  -  4 zdz^  -  4 zf) 

8 W2 


and  g0  in  (24), 


go  = 


3gzd  10e2g 


+ 


322d3 


+ 


3e2g 


82 


1/2 


d  +  zd1/2  I  Zd 

(-2 zdZd  +  6zd2  +  3e2)  JL 


Zd  ~  Zd 


4  (  ^/2  +  2d1 


1/2  V  I/2 


16  2 


1/2 


+  A,1/2)1 


AT 


(-2 2d2d  +  62d2  +  3e2)  /r 
16  ("2  V2  +  zj1/2)  2^5/2 


(35) 

(36) 

(37) 


(38) 


Similarly,  rewriting  F(Q  as  a  function  of  2  and  perform  Laurent  expansion  in  terms  of  2 
around  Zd  and  compare  with  f(z)  in  (24),  and  utilising  (34)  and  (35)-(37),  we  obtain  /o 
and  /1, 


/o  =  f - 

4  2d 


e2g 


+ 


(2 zdzd  -  2 zd2  -  3e2)  g 

V2 


4(4/2+51/2)  53/2  s(T2+^1/2) 

(-22rf2d  -  22d2  +  3e2)  p 


8  (  ZlJ2  +  Zd2/ 2)  2dS/2 


/l  = 


12^ 


V 


3  g 

~i  + 


9z3d/2e2Jl 


3zd/2  ~  2zd2  -3e2)  g 

*\/2  2j1/2N)  3  ~zd2 


2[z1d/2  +  z-d1/2)  z~d3/2 


2(4/2+^2)' 


32^/2  (-2 2d2d  -  22d2  +  3e2)  g 


+ 


4  2 


Zd1/2)  Zd5'2 


i/2  +  zdL,“)  Zd" 


(39) 


(40) 
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(a)  Trajectory  of  a  swimmer:  (xo,  Yo)  =  (10,  2)  (b)  Trajectory  of  a  swimmer:  (Xo,  Yo)  =  (0,  2) 


Trajectory  of  a  swimmer:  (Xq,  Yo)  =  (-1,  2)  M)  Trajectory  of  a  swimmer:  (Xo,  Yo)  =  (-15,  2) 


X 


Figure  4:  Examples  of  trajectories  from  the  initial  points  (xdo,  Udo )  marked  with  red  circles, 
each  with  200  different  initial  heading  directions  do-  ( Xdo,Vdo )  is  (a):(10,  2),  (b):(0,  2), 
(c):(— 1,2),  and  (d):(— 15,2). 

4  Results  of  numerical  simulations 


Results  of  numerical  simulations  using  the  governing  evolution  equations  for  a  treadmilling 
organism  (26)  and  (27)  are  shown  in  this  section.  The  time  integration  for  all  the  simulations 
here  were  performed  by  Matlab  ode45  solver  with  the  absolute  error  tolerance  10-6  and 
relative  error  tolerance  10~3.  The  radius  of  the  circular  body  of  the  treadmilling  organism 
e  is  set  to  be  unity,  giving  the  reference  length  scale  to  the  system.  For  initial  conditions 
(%do,ydo,Qo),  we  took  3237  different  ( xdo,ydo )  €  [-20,  1]  x  [1,  10]  ©  (1,  20]  x  [0,20]  every 
0.5  in  both  x  and  y  direction,  and  200  different  9q  €  [0,  27t)  every  7r/100. 
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(^Trajectory  of  a  swimmer:  (x0,  y0,  0o)  =  (-lO,  10, 3*/4) 


X 

(C)  Trajectory  of  a  swimmer:  (x0,  Yo,  0o)  =  (5, 0, 51^/50) 


X 

(e)  Trajectory  of  a  swimmer:  (x0,  Yo,  0o)  =  (lO,  10, 6*/5) 


-80  -70  -60  -50  -40  -30  -20  -10  0  10 


(b)  Trajectory  of  a  swimmer:  (x0,  Yo,  0o)  =  (-lO,  10, 18?t/25) 


X 

(d)  Trajectory  of  a  swimmer:  (x0,  Yo,  0o)  =  (5,5, 3*710) 


X 


X 


Figure  5:  Examples  of  trajectories  from  the  initial  points  ( Xdo,ydo )  marked  with  red  circles 
each,  with  a  certain  initial  heading  direction  9q.  Treadmilling  organism  can  escape  from 
the  wall,  (a)  and  (d),  be  above  the  wall  (b),  be  underneath  the  wall  (d),  or  crush  into  the 
wall  (e). 
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4.1  Various  trajectories 

By  solving  the  governing  equations  of  the  swimming  organism  (26)  and  (27)  utilising  (38)- 
(40),  we  obtain  many  different  trajectories  depending  upon  different  initial  conditions.  Fig¬ 
ure  5  shows  examples  of  such  trajectories.  At  sufficiently  large  time,  the  treadmilling 
organism  is  possible  to  in  any  quadrant  if  we  choose  appropriate  initial  conditions,  as  we 
can  see  in  Fig. 5  (a)-(d).  There  also  is  a  case  that  the  treadmilling  organism  crushes  into 
the  wall  in  finite  time  as  reported  in  [9] .  An  example  of  this  unphysical  trajectory  is  shown 
in  Fig. 5  (e),  and  will  be  discussed  in  §4.4.  Both  experimental  observations  and  preceding 
theoretical  studies  suggest  that,  when  there  is  a  no-slip  wall  near  a  treadmilling  organism, 
the  organism  tends  to  be  attracted  to  its  own  image  on  the  wall  and  move  towards  the  wall 
[6,  7,  10,  9,  12,  13,  9,  16,  17].  This  behaviour  is  clearly  seen  in  all  the  trajectories  shown 
in  Fig. 5  at  an  early  stage  of  time  integration.  Nevertheless,  in  the  cases  shown  in  Fig. 5  (a) 
and  (d),  the  treadmilling  organism  moves  away  from  the  wall  after  it  has  come  close  to  the 
edge  of  the  wall  once  .  Apparently,  there  are  several  cases  that  the  treadmilling  organism 
hangs  around/above/underneath  the  wall  as  shown  in  Fig. 5  (b)  and  (c).  This  is  consistent 
with  the  results  in  several  preceding  study  with  a  no-slip  wall [6,  7,  10,  9,  12,  13,  9,  16,  17]. 
In  these  case,  the  treadmilling  organism  shows  the  trajectories  bouncing  along  the  wall  as 
those  in  the  case  with  an  infinite  no-slip  wall [16],  but  the  bouncing  pattern  is,  unlike  the 
case  with  the  infinite  no-slip  wall,  irregular  and  sometimes  not  periodic  even  after  suffi¬ 
ciently  large  time  has  passed.  This  is  also  qualitatively  consistent  with  the  results  of  the 
study  with  a  no-slip  infinite  wall  with  a  gap  [17].  Note  that  the  treadmilling  organism  make 
changes  its  heading  direction  and  the  quality  of  its  trajectory  significantly  when  it  arrived 
in  the  vicinity  of  the  edge  of  the  wall,  where  the  image  of  the  organism  cannot  move  to 
the  positive  x-direction  any  more.  Note  also  that,  at  a  single  glance,  the  initial  heading 
direction  9 o  cannot  be  judged  from  the  trajectory  even  approximately.  This  is  a  both  nat¬ 
ural  and  striking  difference  from  the  case  with  an  infinite  wall  [16].  The  strong  asymmetry 
due  to  the  absence  of  the  wall  in  the  region  x  >  0  gives  a  great  influence  to  the  trajecto¬ 
ries,  and  even  the  trajectory  for  9q  doesn’t  make  a  straight  vertical  line.  This  feature  can 
be  clearly  seen  in  Fig. 4  in  which  200  different  trajectories  for  200  different  9q  for  several 
combination  of  ( Xdo,ydo )  are  shown.  In  the  cases  with  the  initial  point  of  the  organism  is 
close  to  or  above  the  wall  (Fig.4  (b)-(d)),  it  is  somewhat  possible  to  predict  the  direction 
that  the  treadmilling  organism  heads  for  at  an  early  stage,  but  when  the  initial  point  is  far 
from  the  wall  (Fig.4  (a)),  the  heading  direction  is  unpredictable  from  the  knowledge  of  the 
trajectories  for  the  cases  with  an  infinite  wall  in  [16].  However,  we  see  a  remarkable  line  on 
which  many  of  the  trajectories  with  different  9q  seem  to  converge  in  all  the  cases  in  Fig.4, 
especially  clearly  in  Fig.4  (a).  This  line  may  give  us  some  kind  of  criterion  in  speculating 
the  trajectory.  As  such  lines  seem  to  fall  upon  the  lines  which  connect  the  edge  of  the  wall 
x  =  y  =  0  and  the  initial  points  of  the  treadmilling  organism  (xdo,ydo)  which  is  marked  by 
red  circles  in  Fig.4,  here,  we  introduce  a  critical  linear  line  £c, 

£c{x)  =  —x.  (41) 

XdO 

Whether  or  not  £c  acts  as  a  critical  line  in  considering  the  direction  of  trajectory  in  the  long 
run  will  be  checked  in  §4.2.  Another  noticeable  feature  we  can  see  from  Fig.fig:AllThetas 
is  the  complexity  of  the  trajectories  for  initial  point  with  a  large  negative  x-coordinate. 
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Figure  6:  (a):  Five  regions.  The  thick  black  line  corresponds  to  the  wall,  (b):  An  example 
of  a  pie  chart.  The  different  colours  indicate  in  what  region  shown  in  the  panel  (a)  the 
treadmilling  organism  from  a  certain  initial  point  is  at  sufficiently  large  time.  Dotted  line 
corresponds  to  the  slope  of  the  critical  line  lc  for  the  considered  initial  point. 


As  already  seen  in  5  and  mentioned  above,  the  treadmilling  organism  changes  its  heading 
direction  and  the  quality  of  its  trajectory  greatly  when  it  comes  to  the  vicinity  of  the  edge 
of  the  wall,  x  =  y  =  0.  The  behaviour  of  the  organism  after  reaching  such  area  significantly 
depends  upon  (x<j,  yd,  8)  of  the  trajectory  at  the  time.  This  makes  the  trajectories  from  large 
negative  Xdo  unpredictable  and  complicated.  For  such  cases,  even  a  very  small  difference  in 
initial  condition  can  result  in  a  huge  difference  to  the  trajectory  of  the  treadmilling  organism, 
and  this  tendency  is  expected  to  become  more  distinct  when  the  initial  x-coordinate  x^ 
has  larger  negative  value.  We  will  come  back  this  point  in  §4.2  again. 

4.2  Escaping  probability 

Now,  we  classify  a  wide  variety  of  trajectories  in  terms  of  the  regions  in  which  the  tread¬ 
milling  organism  is  at  sufficiently  large  time.  For  this  purpose,  we  first  introduce  five  regions 
shown  in  Fig.6  (a).  Region  1  is  {(x,y)  E  (0.2e,  oo)  x  [0,0. 2e]  ©  (0,  oo)  x  (0.2e,  oo)},  and  is 
coloured  red  in  Fig.6  (a).  This  is  the  region  in  which  the  treadmilling  organism  that  escapes 
from  the  wall  to  y  >  0  direction  is.  Region  2  is  {(x,  y )  E  (— oo,  0]  x  (0.2e,  oo)},  and  is  coloured 
blue  in  Fig.6  (a).  This  is  the  region  in  which  the  treadmilling  organism  that  goes  back  to 
above  the  wall  is.  Region  3  is  {( x,y )  E  (— oo,0]  x  (— oo,  — 0.2e)},  and  is  coloured  orange  in 
Fig.6  (a).  This  is  the  region  in  which  the  treadmilling  organism  that  escapes  from  the  wall 
to  y  <  0  direction  is.  Region  4  is  {(x,  y)  E  (0.2e,  oo)  x  [— 0.2e,  0)  ©  (0,  oo)  x  (— oo,  0.2e)},  and 
is  coloured  green  in  Fig.6  (a).  This  is  the  region  in  which  the  treadmilling  organism  that 
goes  back  to  underneath  the  wall  is.  Then  Region  5  is  {(x,  y)  E  (— oo,0.2e]  x  [ — 0.2e,  0.2e]}, 
and  is  coloured  pink  in  Fig.6  (a).  This  is  to  describe  the  case  that  the  treadmilling  organism 
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Figure  7:  Pie  charts  at  t  =  1500  for  several  initial  points,  x  and  y  coordinates  of  the  point 
on  which  each  pie  chart  is  placed  and  correspond  to  the  initial  point  for  the  pie  chart.  Thick 
black  line  represents  the  wall. 


crushes  into  the  wall. 

Utilising  these  five  regions,  we  make  a  pie  chart  for  each  considering  initial  point.  The 
pie  chart  shows  in  what  region  the  treadmilling  organism  will  be  at  a  large  time,  here 
t  =  1500,  if  it  is  given  a  certain  initial  heading  direction  6q.  An  example  of  such  pie  chart  is 
given  in  Fig. 6  (b).  Each  colour  appear  in  this  pie  chart  corresponds  to  the  colour  given  for 
each  region  in  Fig. 6  (a).  The  circular  sector  where  the  angle  measured  from  the  solid  black 
arrow  is  in  [$o  —  7r/200,  +  7r / 200]  corresponds  to  the  initial  heading  direction  6$  of  the 
treadmilling  organism,  and  the  colour  there  indicate  the  region  in  which  the  treadmilling 
organism  is  at  sufficiently  large  time.  For  instance,  the  pie  chart  in  Fig. 6  (b)  shows  that 
the  treadmilling  organism  initially  placed  at  the  point  ( Xdo,Vdo )  =  (2,2)  and  given  the 
heading  direction  6q  =  3/4-7T  will  be  in  the  Region  1  at  time  t  =  1500.  The  dotted  black 
line  in  Fig. 6  (b)  corresponds  to  the  slope  of  the  critical  line  £c  (41)  for  the  considered  initial 
point  ( Xdo,ydo )  =  (2,2).  This  is  to  see  whether  or  not,  or  to  what  degree,  lc  can  act  as 
a  critical  line  in  predicting  the  asymptotic  position  of  the  treadmilling  organism.  For  the 
initial  position  ( Xdo,ydo )  =  (2,2),  we  can  see  that,  from  Fig. 6  (b),  the  dotted  line  fall  near 
on  the  boundary  between  the  blue(Region  2)  and  the  green(Region  4)  regions. 

Figure  7  shows  pie  charts  for  several  initial  points  (xdo,ydo)-  The  most  impressive  feature 
is  the  complex  city  of  the  pie  charts  for  the  initial  points  with  large  negative  x-coordinates. 
This  reflects  the  fact,  which  has  been  discussed  in  §4.1,  that  the  treadmilling  organism 
changes  its  heading  direction  and  the  quality  of  its  trajectory  significantly  when  it  has 


297 


come  to  the  vicinity  of  the  edge  of  the  wall,  x  =  y  =  0  depending  upon  its  (. x yd,  0)  at  the 
time,  and  so  even  a  very  small  difference  in  initial  condition  can  result  in  a  huge  difference  to 
its  trajectory  when  the  initial  x-coordinate  Xdo  has  large  negative  value.  Note  that,  however, 
Pie  charts  tend  to  become  rather  simpler  for  larger  ydo  for  any  fixed  xdo  •  One  more  thing  we 
can  easily  notice  is  that  there  is  some  limitation  for  the  region  that  the  treadmilling  organism 
is  in  at  t  =  1500  depending  upon  its  initial  position.  Broadly,  the  treadmilling  organism 
is  in  regions  1,2,  or  5  when  its  (xdo,ydo)  £  [ — 20, 0]  x  [1,10];  regions  1,2,4,  or  5  when  its 
(■ xdo,Vdo )  €  (0,20]  x  [1, 10];  regions  1,2,  and  4  when  xd0  €  (0,20],ydo  =  0.  The  possibility 
of  going  the  region  3  where  is  underneath  the  wall  that  the  cases  the  initial  y  position  yd o 
is  zero  is  natural  because  of  the  symmetry  about  y  =  0  the  they  have.  When  we  compare 
the  pie  charts  at  a  certain  x-coordinate,  it  is  apparent  that  the  pie  charts  become  simpler 
for  larger  yd o-  This  is  because  of  the  effect  of  asymmetry  that  the  wall  gives  to  the  system. 
Also,  although  all  the  pie  charts  almost  have  2-fold  rotational  symmetry,  desymmetrised 
part  becomes  obvious  when  the  initial  point  is  near  the  wall,  where  the  asymmetric  effect 
of  the  system  becomes  strong.  From  the  pie  charts  in  Fig. 7,  it  is  apparent  that  tc  does  not 
act  as  a  critical  line  as  we  expected  in  §4.1  in  any  way  in  considering  the  position  of  the 
treadmilling  organism  at  sufficiently  large  time,  at  least  at  t  =  1500.  However,  ellc  still 
falls  near  one  of  the  boundaries  of  two  different  regions  in  the  cases  with  positive  initial 
x-coordinate  xdo  >  0  and  in  the  cases  whose  initial  positions  are  far  from  the  wall. 

Now  let  us  consider  the  escaping  probability  Pe,  the  probability  that  the  treadmilling 
organism  can  escape  from  the  wall,  or  its  own  image  on  the  wall.  Although,  ideally,  this 
is  to  be  defined  by  means  of  the  positions  of  the  treadmilling  organism  at  t  — >•  oo,  here  we 
define  this  Pe  as  the  probability  of  the  treadmilling  organism’s  being  in  the  region  1  or  4 
at  sufficiently  large  time  t  =  ti,  he., 


Pe  = 


{#0q  |  €  regions  1  or  4 

Wo 


at  t  =  t;} 


(42) 


The  escaping  probability  Pe  for  ti  =  1500  for  different  initial  conditions  are  shown  in 
Fig. 8,  where  each  (x,  y)  coordinate  corresponds  to  each  considered  initial  position  of  the 
treadmilling  organism.  Pe  takes  the  value  0  <  Pe  <  1  for  almost  all  the  initial  positions  in 
Fig8;  Pe  is  zero  for  only  several  points  with  (x^o  <  0,ydo  =  0),  which  is  the  case  that  the 
treadmilling  organism  touches  the  wall  at  the  initial  time  and  rather  meaningless  in  this 
context,  and  there  is  no  point  where  Pe  is  unity.  This  suggests  that  treadmilling  organism 
has  the  possibility  of  both  escaping  from  the  wall  and  being  above /underneath  the  wall  for 
any  initial  position.  Even  when  the  treadmilling  organism  is  initially  placed  at  a  position  far 
from  the  wall,  it  feels  the  presence  of  its  own  image  on  the  wall  and  move  towards  the  wall 
if  its  initial  heading  direction  6 o  is  properly  chosen.  Similarly,  even  when  the  treadmilling 
organism  is  initially  placed  at  a  point  with  large  negative  x-coordinate,  above  the  wall  and 
far  from  the  edge  of  the  wall,  for  some  6q.  it  finally  moves  away  from  its  own  image  on  the 
wall.  The  distribution  and  the  monotonic  change  of  the  value  of  Pe  in  the  region  where 
x  >  0  seems  to  be  reasonable.  However,  in  the  region  where  x  <  0,  there  are  couple  of  lines 
which  has  larger  Pe  value  than  naive  expectation.  This  might  be  the  result  of  the  complex 
pie  charts  in  these  region,  but  the  reason  why  we  see  such  fairly  larger  Pe  value  on  the 
points  on  certain  lines  is  not  clear  and  needs  further  investigations. 
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Escaping  probability  PE  for  each  initial  point 


20 


y  10 


0.2  0.4  0.6  0.8 


Figure  8:  Escaping  probability  at  t  =  1500  for  different  initial  points.  Each  (x,  y )  coordinate 
and  the  thick  black  line  corresponds  to  the  considered  initial  position  of  the  treadmilling 
organism  and  the  wall,  respectively. 
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Figure  9:  Trajectory  of  the  temporal  change  of  the  speed  of  the  treadmilling  organism 
for  the  case  initial  condition  (xdo,ydo,@o)  =  (— 10, 10,  37t/4).  (a):  the  trajectory  (b):  the 
temporal  variation  of  the  y-coordinate  of  the  organism  z d  (blue  solid  line)  and  the  temporal 
variation  of  the  speed  of  the  organism  (green  solid  line). 

4.3  Temporal  change  of  the  speed  of  the  treadmilling  organism  in  region 
1  and  4 

For  the  definition  of  the  escaping  probability  Pe  in  §4.2,  we  have  considered  the  probability 
of  the  treadmilling  organism’s  being  in  the  region  1  or  4  at  large  time  t  =  t/.  Then  the 
following  questions  may  always  exit:  has  the  treadmilling  organism  surely  escaped  from  the 
wall  when  it  is  in  these  region  at  large  time?,  does  it  stop  if  it  reaches  a  point  sufficiently 
far  from  the  wall?  An  example  of  the  temporal  variation  of  the  speed  of  the  treadmilling 
organism  xd2  +  yd2  is  shown  in  Fig. 9.  When  the  treadmilling  organism  comes  in  the 
vicinity  of  the  edge  of  the  wall  and  changes  its  heading  direction,  the  speed  of  the  organism 
shows  a  steep  increase.  After  that  the  treadmilling  organism  moves  away  from  the  wall. 
Then,  as  the  distance  from  the  wall  increase,  the  moving  speed  of  the  organism  decreases. 
The  speed  is  10  2  at  t  103  and  keep  decreasing.  This  is  because  that  the  farther  away 
from  the  wall  the  treadmilling  organism  goes,  the  effect  of  its  image  on  the  wall  becomes 
weaker  since  the  image  cannot  follow  the  organism  any  further  than  the  edge  of  the  wall. 
Nevertheless,  we  cannot  be  perfectly  sure  whether  or  not  the  speed  of  the  treadmilling 
organism  approaches  zero  as  t  — >•  oo,  and  whether  or  not  the  treadmilling  organism  has 
really  escaped  from  the  wall  in  this  case,  only  from  Fig. 9.  To  confirm  these  points,  we  need 
a  longer  time  integration  with  much  higher  accuracy;  this  should  be  a  further  work. 

4.4  Velocity  field 

Fig.  10  shows  the  velocity  field  calculated  from  the  two  Goursat  functions  when  the  tread¬ 
milling  organism  is  placed  at  the  point  (x,y)  =  (2,2).  It  can  be  confirmed  from  Fig. 10(a) 
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(a)  Velocity  field,  Ux(x,  y)  (b)  Velocity  field,  uy(x,  y) 


Figure  10:  Velocity  field  when  the  treadmilling  organism  is  at  the  point  (x,y)  =  (2,2). 
(a)  and  (b)(a):  x-velocity  ux(x.  y)  (b)  :  y-velocity  uy(x,y )  (c):  x-velocity  at  y  =  2 

ux(x,y  =  2)  (d):  y-velocity  at  x  =  2  uy(x  =  2 ,y).  The  darkness  of  colour  represents  the 
value  of  the  velocity,  the  thin  and  the  thick  solid  line  are  contour  of  the  velocity  and  the 
no-slip  wall,  respectively. 
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and  (b)  that  the  boundary  condition  on  the  no-slip  wall  is  satisfied,  and  the  velocity  di¬ 
verges  at  the  point  (x,y)  =  (2,2)  where  the  singularity,  which  represents  the  treadmilling 
organism,  is  placed.  The  boundary  condition  on  the  surface  of  the  treadmilling  organism, 
on  the  contrary,  is  not  satisfied;  the  contours  of  the  both  x-  and  y-  velocity  are  not  circles 
around  the  treadmilling  organism.  This  is  because  the  model  used  in  this  project  is  very 
simple  and  has  failed  to  capture  all  the  features  of  the  treadmilling  organism  we  need  to 
consider.  This  may  affect  the  behaviour  of  the  treadmilling  organism,  especially  when  it 
is  near  the  wall.  This  may  be  one  of  the  reasons  why  we  had  unphysical  trajectories  such 
as  the  treadmilling  organism  crushes  into  the  wall.  Nevertheless,  at  least  for  the  case  that 
the  treadmilling  organism  bounces  back  on  the  wall,  we  have  confirmed  that  the  qualita¬ 
tive  behaviour  of  the  organism  does  not  change  even  when  the  model  which  shows  better 
boundary  condition.  This  suggests  that  it  may  probably  be  possible  to  consider  that  the 
trajectories,  apart  from  those  that  the  treadmilling  organism  crushes  into  the  wall,  we  have 
found  in  this  report  are  reasonable. 

5  Discussion  and  conclusions 

The  governing  evolution  equations  for  a  treadmilling  organism  when  there  is  a  half-infinite 
non-slip  wall  in  the  considered  unbounded  domain  have  been  derived,  by  means  of  an 
image  system  method,  in  §3.  The  treadmilling  organism  was  assumed  to  have  a  circle 
body  of  radius  e  and  a  moving  centre  Zd(t)  =  Xd(t)  +  iyd(t )  in  such  a  domain.  It  has  been 
modelled  as  an  equivalent  point  singularity  description  with  stresslet  of  strength  y  and  the 
quadrupole  of  strength  2e2y  at  Zd  in  Stokes  flow.  The  existence  of  the  wall  of  an  infinitesimal 
thickness  causes  a  critical  problem  when  an  image  system  method  is  employed;  it  is  not 
straightforward  to  determine  where  to  place  the  image  of  the  treadmilling  organism  on  the 
wall.  To  avoid  this  problem,  a  conformal  mapping  Q  —  iz1/2  was  employed,  which  maps 
{z  €  C}  to  {£  |  Im(C)  >  0},  especially,  the  half-infinite  wall  in  2-space  to  the  infinite  wall  in 
((-space.  Then  Zd  and  its  image  were  defined  through  Qd-,  the  image  of  Cd  on  the  infinite  wall 
in  ((-space  overline(d ,  and  the  conformal  mapping.  By  imposing  two  boundary  conditions, 
one  on  the  body  of  the  organism  and  the  other  on  the  no-slip  wall,  the  coefficients  of  two 
Goursat  functions  f(z,t )  and  g(z,t),  which  gives  the  stream  function  of  the  flow,  were 
determined.  The  governing  evolution  equations  of  the  position  and  the  heading  direction  of 
the  treadmilling  organism,  Zd  and  9d,  were  obtained  by  taking  the  analytic  part  of  the  flow 
velocity  and  the  half  of  the  analytic  part  of  the  vorticity  of  the  flow,  respectively. 

The  trajectories  of  the  treadmilling  organism  were  numerically  calculated  in  §4.  The 
different  trajectories  of  the  treadmilling  organism  depending  upon  different  initial  conditions 
were  discussed  in  §4.1.  The  treadmilling  organism  was  attracted  to  its  image  on  the  wall, 
as  suggested  in  previous  experimental  and  theoretical  studies  [6,  7,  10,  9,  12,  13,  9,  16,  17], 
at  an  early  stage  of  the  time  integration.  Nevertheless,  in  some  cases,  the  treadmilling 
organism  moved  away  from  the  wall  after  it  had  come  to  the  vicinity  of  the  edge  of  the  wall 
once.  To  investigate  this  behaviour  further,  in  the  first  part  of  §4.2,  we  have  divided  R2 
domain  into  five  regions  and  made  pie  charts  which  show  in  what  region  the  treadmilling 
organism  is  at  sufficiently  large  time.  The  pie  charts  for  different  initial  positions  provided 
in  §4.2,  demonstrated  that  those  for  the  initial  points  with  larger  negative  x-coordinates 
have  more  complex  structures  compared  to  those  with  smaller  negative  x-coordinates.  This 
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reflects  the  fact  that  the  treadmilling  organism  changes  in  its  heading  direction  and  the 
quality  of  its  trajectory  significantly  when  it  comes  to  the  vicinity  of  the  edge  of  the  wall, 
x  =  y  =  0  depending  upon  its  (xd,  Vd,  9)  at  the  time,  and  that  even  a  very  small  difference 
in  initial  condition  can  make  a  huge  difference  to  its  trajectory  when  the  initial  x-coordinate 
Xdo  has  larger  negative  value  as  seen  in  §4.1.  We  further  defined  and  numerically  calculated 
the  escaping  probability  Pe,  the  probability  that  the  treadmilling  organism  can  escape  from 
the  wall,  or  its  image  on  the  wall  in  the  latter  part  of  §4.1.  This  Pe  did  not  take  0  or  1 
value  for  any  physically  reasonable  initial  point.  This  suggests  that  treadmilling  organism 
has  the  possibility  of  both  escaping  from  the  wall  and  being  above / underneath  the  wall  for 
any  initial  position.  Even  when  the  treadmilling  organism  is  initially  placed  at  a  position 
far  from  the  wall,  it  feels  the  presence  of  its  image  on  the  wall  and  move  towards  the  wall 
if  its  own  initial  heading  direction  9q  is  appropriately  chosen.  Similarly,  even  when  the 
treadmilling  organism  is  initially  placed  at  a  point  with  large  negative  x-coordinate,  above 
the  wall  and  far  from  the  edge  of  the  wall,  for  some  Oq,  it  can  finally  move  away  from  its  own 
image  on  the  wall.  When  the  treadmilling  organism  moves  away  from  the  wall,  its  speed 
decreases  as  the  distance  from  the  wall  increases  as  discussed  in  §4.3.  This  is  because  that 
the  treadmilling  organism  feels  less  effect  of  its  own  image  on  the  wall  as  it  moves  farther 
from  the  wall,  since  the  image  cannot  follow  the  organism  any  further  than  the  edge  of  the 
wall. 

The  results  in  this  project  seem  reasonable  in  some  respects.  However,  the  obtained 
Pe  does  not  take  0  or  1  for  any  physically  reasonable  initial  point,  even  with  large  posi¬ 
tive/negative  x-coordinate.  This  may  suggest  that  the  combination  of  the  singularities  used 
required  some  modifications  when  the  treadmilling  organism  was  far  from  the  edge  of  the 
wall.  The  situation  with  a  half-infinite  no-slip  wall  in  an  unbounded  domain  corresponds 
to  the  case  in  which  the  no-slip  wall  has  a  gap  as  studied  in  [17]  but  of  an  infinite  width. 
Checking  whether  or  not  the  results  gained  in  this  report  and  those  for  the  case  with  no¬ 
slip  wall  with  a  gap  in  the  limit  of  width  of  the  gap  — >•  oo  may  help  us  gain  some  further 
understanding  of  this  problem.  The  behaviour  of  a  treadmilling  organism  near  a  90°  no-slip 
corner  or  even  a  no-slip  corner  with  arbitrary  angle  in  an  unbounded  domain  will  be  our 
next  problem  to  investigate.  These  will  be  an  important  step  to  clarifying  how  a  tread¬ 
milling  organism  behaves  and  the  effect  of  its  mixing  when  it  is  in  a  bounded  domain  with 
no-slip  boundaries  of  complicated  forms. 
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1  Introduction 

The  mixing  of  tracers  on  the  surface  of  a  sphere  has  many  geophysical  applications  from  the 
advection  of  volcanic  ash  to  the  global  redistribution  of  temperature  and  angular  momentum 
by  atmospheric  general  circulation.  Due  to  the  difficulties  of  spherical  geometry,  such  as 
the  poles  and  the  isotropy  between  the  meridional  and  zonal  directions,  previous  work  on 
optimal  mixing  focuses  instead  on  the  d-dimensional  torus  [5,  7,  8].  Here  we  are  motivated 
by  the  geophysical  problem  of  temperature  redistribution  over  the  surface  of  the  sphere 
given  a  heat  source  at  the  equator  and  cooling  at  the  poles.  What  flow  most  efficiently 
mixes  the  temperature  field  and  how  do  the  characteristics  of  these  optimal  flows  compare 
to  similar  flows  on  the  torus? 

Before  describing  the  properties  of  optimal  flows,  we  must  first  define  a  measure  of 
mixing  efficiency  that  will  then  be  used  to  define  a  flow  that  most  optimally  mixes  a  tracer 
field.  One  measure  of  mixing  is  the  variance  of  the  tracer  concentration.  If  we  consider  the 
time  evolution  of  a  tracer  field  with  zero  mean  subject  to  advective  stirring  and  diffusivity, 
the  tracer  concentration  8  obeys  the  advection-diffusion  equation 

30 

—  +  u-V0  -  kA0  =  0  (1) 

at 

where  u  is  the  advective  velocity  (independent  of  the  tracer  concentration  6)  and  where  k 
is  the  molecular  diffusivity  coefficient.  Multiplying  (1)  by  8  and  averaging  over  the  domain 
yields  the  time  rate  of  change  of  the  variance  of  8  given  by 

=  -2k(\\78\ 2>  (2) 

where  the  integral  over  the  domain  V  is  denoted  by 

(9)  =  ±fv8dV.  (3) 

From  (3)  we  see  that  the  variance  of  the  tracer  concentration  (|0|2)  decreases  monotonically 
as  a  function  of  the  molecular  diffusivity  k  and  the  magnitude  of  the  gradients  of  the  tracer 
field.  As  both  k  and  (|V0|2)  are  positive  definite,  the  variance  of  the  tracer  tends  to  a 
constant  as  the  gradients  of  the  tracer  field  tend  to  zero  in  the  absence  of  sources  or  sinks 
of  the  tracer.  In  other  words,  diffusion  acts  to  destroy  gradients  in  the  tracer  field  and 
homogenize  8. 
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Diffusivity  can  reduce  the  variance  of  a  tracer  field  in  the  absence  of  stirring,  however 
this  process  of  diffusive  mixing  is  quite  inefficient.  We  can  significantly  enhance  the  process 
of  mixing  and  homogenization  by  including  a  stirring  flow  u.  The  mechanical  act  of  stir¬ 
ring  stretches  and  folds  material  lines  of  the  tracer  field,  enhances  the  gradients  of  8,  and 
ultimately  leads  to  a  more  rapid  homogenization  of  the  tracer  field. 

If  there  are  no  source  terms  to  maintain  gradients  in  8  diffusivity  will  lead  to  a  spatially 
homogeneous  tracer  field  over  time.  We  can  obtain  a  nonzero  (|0|2)  as  t  — >  oo  by  including 
a  spatially  varying  tracer  source  term  in  (1), 

38 

—  +  u-V8  -  kA8  =  ,s{x).  (4) 

The  homogenization  of  the  tracer  field  by  mixing  can  then  balance  the  production  of  tracer 
variance  by  the  source  in  steady  state.  It  is  within  this  steady  state  that  we  will  evaluate 
the  effect  of  stirring  on  the  mixing  process. 

As  we  expect  stirring  to  enhance  mixing,  we  anticipate  that  for  the  same  value  of  k 
the  variance  of  a  stirred  tracer  field  will  be  less  than  that  of  an  unstirred  tracer  in  steady 
state.  Thiffeault  et  al.  [7]  measure  the  relative  mixing  enhancement  due  to  stirring  through 
a  measure  given  by  ratio  of  the  unstirred  to  stirred  steady  state  variances, 


where  8q  satisfies  the  unstirred,  steady  state  diffusion  equation 

—k  A8o  =  s(x )  (6) 

and  where  8  satisfies  the  stirred,  steady  state  advection-diffusion  equation 

u  ■  V#  —  kA6  =  s(x).  (7) 

A  mixing  efficiency  of  8  =  1  would  imply  that  the  velocity  field  does  not  enhance  mixing 
relative  to  diffusive  processes,  as  would  be  the  case  if  u  were  parallel  to  lines  of  constant 
8  such  that  the  advective  term  u  ■  X78  of  (4)  vanished.  Mixing  efficiencies  of  8  <  1  will  be 
obtained  if  u  enhances  gradients  of  8  in  steady  state.  Such  small  mixing  efficiencies  are 
counterintuitive  if  we  assume  that  stirring  tends  to  improve  the  homogenization  of  a  tracer 
with  time,  as  is  the  case  for  ffows  with  mixing  efficiencies  of  8  >  1. 

By  defining  the  mixing  efficiency  factor  8  we  can  then  create  an  optimization  problem 
to  determine  the  most  efficient  ffow  fields  over  the  surface  of  the  sphere  by  maximizing  8. 
Many  approaches  have  been  taken  to  this  problem  in  the  past,  however  all  have  focused 
on  flows  over  the  torus.  There  are  two  primary  methods  of  maximizing  8\  finding  the  ffow 
that  most  efficiently  mixes  a  tracer  field  for  a  given  source  function  or,  alternatively,  finding 
the  source  function  that  is  most  efficiently  mixed  by  a  fixed  velocity  field.  Thiffeault  and 
Pavliotis  [8]  consider  the  problem  of  optimizing  the  source  function  for  a  fixed  velocity  field 
and  find  that  these  sources  tend  to  have  maxima  and  minima  located  near  in  regions  of 
strongest  velocity  and  are  aligned  such  that  the  flows  transport  sources  over  sinks. 

Alternatively,  Lin  et  al.  [3]  and  Thiffeault  [6]  maximize  the  mixing  efficiency  for  a  fixed 
source  by  varying  the  velocity  field.  Lin  et  al.  [3]  determine  the  velocity  field  that  most 


307 


quickly  reduces  a  measure  of  mixing  similar  to  £  in  the  absence  of  diffusivity.  In  contrast 
to  the  transient  problem  considered  by  Lin  et  al.  [3],  Thiffeault  [6]  introduce  a  means  of 
solving  for  the  velocity  fields  that  most  efficiently  mix  a  tracer  concentration  maintained 
by  a  spatially  varying  source  term  and  subject  to  molecular  diffusivity. 

Both  Lin  et  al.  [3]  and  Thiffeault  [6]  find  optimal  velocity  fields  on  the  torus.  Here  we 
will  expand  on  the  approach  of  Thiffeault  [6]  to  consider  optimal  flows  over  the  surface  of 
the  sphere.  We  choose  to  focus  on  three  points  of  comparisons  between  optimal  flows  on 
the  torus  and  the  sphere:  the  issue  of  upper  and  lower  bounds  on  the  mixing  efficiencies, 
the  structure  of  the  most  optimal  flow  configurations,  and  the  scaling  of  the  efficiencies  in 
the  Peclet  number. 

In  Section  2  we  describe  the  numerical  model  we  will  use  to  find  velocity  fields  that 
maximize  £.  We  then  compare  the  analytical  lower  bound  of  the  mixing  efficiencies  on  the 
torus  to  that  of  the  sphere  in  Section  3  and  see  if  the  efficiencies  scale  as  a  function  of  the 
Peclet  number  on  the  sphere  as  they  do  on  the  torus  in  Section  4.  We  will  then  discuss 
the  possibility  of  an  efficient  ‘sweeping’  flow  for  the  sphere  in  Section  5.  In  Section  6  we 
consider  future  directions  of  this  work. 

2  The  Model 

2.1  Euler-Lagrange  Equations 

We  maximize  £  in  a  spherical  geometry  by  varying  the  advective  velocity  field  while  holding 
constant  the  structure  of  the  source  function  and  the  diffusivity.  The  mixing  efficiency  £ 
as  given  in  (5)  has  two  components:  the  variance  of  the  unstirred  steady  state  tracer  field 
(|0O|2>  and  the  variance  of  the  stirred  steady  state  tracer  field  (|#j2).  These  tracer  fields  can 
be  written  as 


9  =  £~1s  6q  =  £q  (8) 

where  C  and  Co  are  the  advection-diffusion  and  diffusion  operators  given  by 

C  =  u  •  V  —  kA  Cq  =  —kA.  (9) 

From  (8)  and  (9),  we  see  that  the  solution  9q  to  the  diffusion-only  differential  equation 
will  be  constant  under  a  change  in  u.  Changing  u  will  only  influence  the  steady  state 
stirred  tracer  field  9  such  that  maximizing  £  is  equivalent  to  minimizing  (|0|2).  In  order  to 
minimize  (|#|2)  with  respect  to  u ,  we  first  define  the  variation  of  (|0|2)  following  Thiffeault 
[6] 

5(|@|2)  =  <5(|£-1s|2)  (10) 

where  we  have  used  9  =  £_1s  from  (8). 

As  we  are  holding  the  source  function  constant,  only  the  integral  operator  £_1  changes 
under  a  change  in  u  such  that 


<5(|0|2)  =  2  {C-1sSC~1s) 


(11) 
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which  can  be  simplified  using  the  identity  5C  1  =  —  C  l5CL  1, 

(5(|0|2)  =  -2(£-1s£-15££-1s).  (12) 

After  integrating  (12)  by  parts,  <S(|0|2}  can  be  written  in  terms  of  the  self-adjoint  operator 

A  =  CC\  (13) 

where  C)  =  —u  •  V  —  kS  is  the  adjoint  of  the  advection-diffusion  operator,  such  that 

<5(|0|2)  =  -2  (A^sdCC-h).  (14) 

As  the  variation  in  the  advection  diffusion  operator  only  depends  on  the  variation  of  the 
velocity  field,  5C  =  5u  ■  V,  we  can  write  the  variation  of  (|#|2)  in  terms  of  5u , 

<S(|0|2}  =  -2(A'1s5w-  VT”1*).  (15) 

The  functional  derivative  of  (|0|2)  with  respect  to  u  then  is 

— —  =  -2A~1sV£”1.s.  (16) 

ou 

We  could  find  the  extrema  of  (|#|2)  by  finding  the  zeroes  of  (16),  however  this  would 
tell  us  little  about  the  structure  of  the  most  efficient  flows  without  additional  constraints. 
We  place  a  constraint  on  the  energy  and  restrict  our  attention  to  non-divergent  flows  by 
creating  an  extended  functional  of  (|0|2) 

T(u)  =  (I e\ 2)  +  7  «M2>  -  u 2)  +  (I/V  •  u)  (17) 

whose  variation  with  respect  to  u  is 

AFpu]  _  _^-igy£-ig  4.  ryU  _  Vi/  =  0.  (18) 

ou 

Equation  (18)  is  the  Euler-Lagrange  equation  for  the  constrained  optimization  problem 
where  7  and  u  are  the  Lagrange  multipliers  for  the  energy  and  non-divergence  constraints, 
respectively.  Flows  that  satisfy  these  constraints  have  a  kinetic  energy  (|w|2)  equal  to  a 
constant  value  U2  and  are  non-divergent  such  that  V  •  u  =  0. 

2.2  Numerical  Model 

There  are  numerous  methods  of  finding  velocity  fields  that  maximize  £  and  minimize  (|0|2). 
Thiffeault  [6],  for  example,  suggests  a  method  to  directly  solve  the  Euler-Lagrange  equations 
given  in  (18)  for  the  velocity  field.  Here  we  consider  an  alternate  means  of  finding  the 
minima  of  (|0|2).  We  perform  a  two-part  optimization  process  based  on  Matlab’s  built-in 
nonlinear  optimization  routine  fmincon.  This  routine,  a  part  of  Matlab’s  Optimization 
Toolbox,  minimizes  (|0|2)  subject  to  the  non-divergence  and  energy  constraints. 

The  fmincon  routine  minimizes  (|#|2)  using  the  iterative  interior  point  algorithm.  At 
each  iteration  of  u  and  £[tt],  we  must  solve  the  advection-diffusion  equation  (8)  for  9.  For 
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the  torus  problem,  we  can  solve  (8)  for  6  by  generating  an  invertible,  spectral  advection- 
diffusion  operator  matrix  C\v\.  The  tracer  concentration  6  is  then  obtained  by  solving  the 
system  of  linear  equations  using  this  non-singular  operator.  Due  to  time  limits,  we  were 
unable  to  generate  a  non-singular,  spectral  advection-diffusion  operator  for  the  sphere.  In 
order  to  find  optimal  solutions,  we  instead  solve  (8)  for  6  using  f  solve  where  f  solve  is 
a  built-in  Matlab  routine  that  solves  a  system  of  nonlinear  equations.  Resorting  to  f  solve 
leads  to  an  increased  computational  cost  in  the  spherical  problem  relative  to  the  torus 
problem. 

In  addition  to  solving  for  6  using  f  solve  for  varying  u,  we  must  consider  that  not  all 
variations  in  u  are  permitted  due  to  constraints.  The  optimization  problem  is  constrained 
in  two  ways:  the  kinetic  energy  of  the  flow  is  held  fixed  at  a  constant  value  and  the  flow 
field  must  be  non-divergent.  The  energy  constraint  limits  the  complexity  and  strength  of 
the  flow  such  that  the  optimal  flow  field  u  in  steady  state  satisfies 

(\u\2)  =  U2  (19) 


where  U  is  a  constant  equal  to  unity  for  all  optimal  and  sub-optinral  flows  considered  here. 

The  energy  constraint  (19),  along  with  the  coefficient  of  diffusivity  k  and  the  radius  of 
the  sphere  a,  allow  us  to  calculate  the  Peclet  number  for  a  given  flow, 


(20) 


where  Pe  is  a  nondimensional  parameter  relating  the  relative  magnitude  of  the  advective 
stirring  to  the  molecular  diffusivity  in  the  advection-diffusion  equation  (1)  such  that  mixing 
processes  with  large  Pe  are  advection-dominated.  The  radius  of  the  sphere  is  held  fixed  at 
a  =  1  so  that  the  Peclet  number  of  all  experiments  here  is  a  function  of  the  diffusivity  only. 

The  non-divergence  constraint  V  •  u  can  be  satisfied  by  defining  a  streamfunction  if) 
such  that  the  zonal  and  meridional  velocities  over  the  sphere,  u  and  v  respectively,  satisfy 


1 

a  d(j) 


1  d'lp 
a  cos  <f>  d\ 


(21) 


where  —n  <  4>  <  it  is  the  latitude  and  0  <  A  <  27r  is  the  longitude.  It  should  be  mentioned 
that  we  are  using  a  geophysical  coordinate  system  where  4>  =  0  is  at  the  equator  rather  than 
standard  spherical  coordinates  where  the  polar  angle  is  typically  zero  at  the  north  pole. 

As  we  are  interested  in  flows  over  the  surface  of  a  thin  spherical  shell,  it  is  useful  to 
recast  all  operators  and  variables  in  terms  of  spherical  coordinates.  We  use  a  pseudo-spectral 
approach  to  calculating  the  advection-diffusion  operator  C.  Transformations  between  grid 
space  and  spherical  harmonics  in  addition  to  the  calculation  of  derivatives  in  spherical 
coordinates  are  performed  using  Algorithm  888,  a  package  of  Matlab  routines  for  spherical 
harmonic  functions  [1].  Using  Algorithm  888,  we  cast  all  scalars  in  terms  of  their  spherical 
harmonics, 


nwiTEWM  <22> 

n=0  m=—n 
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Figure  1:  The  geophysical  source  in  arbitrary  units,  with  heating  (red)  at  the  equator  and 
cooling  (blue)  at  the  poles. 


where  6  is  the  spectral  coefficient  of  8,  n  is  the  total  wavenumber,  m  is  the  zonal  wavenum¬ 
ber,  and  Y™(\,  n)  is  the  spherical  harmonic  of  longitude  A  and  of  y  =  sin</>.  The  spherical 
harmonic  is  given  by 


Y™(\,y)  =  eimXP™(y). 


where  P™{y)  are  the  associated  Legendre  functions  defined  by 


(23) 


pm 
r  n 


(v)  = 


1  / (2 n  +  1)  (n  —  m)\ 


2  (to  +  m) ! 


(1  -  A*2) 


m/2 


^n-\-m 

dyn+m 


-  1)’ 


(24) 


The  wavenumbers  n  and  m  describe  the  structure  of  the  scalar  field  in  latitude  and 
longitude.  In  the  zonal  direction,  m  is  similar  to  the  x-wavenumber  on  the  torus  and  is 
simply  a  superposition  of  Fourier  modes.  The  geometries  of  the  torus  and  the  sphere, 
however,  differ  significantly  in  the  meridional  direction.  Instead  of  a  y— wavenumber  as  on 
the  torus,  spherical  harmonics  are  cast  in  terms  of  a  total  wavenumber  n  where  n  —  m  are 
the  number  of  nodes  in  the  meridional  direction. 

The  geophysical  source  function  we  consider  here  is  given  by  the  n  =  2,  m  =  0  spherical 
harmonic  with  a  spectral  amplitude  of  —1.  The  source  function  can  then  be  written  as 

4  A,  </>)  =  (— 1)T2°(A,  n)  =  -P2°(m).  (25) 
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As  n  —  m  =  2  and  m  =  0  for  the  source  function,  s(A,  <f>)  has  two  nodes  in  latitude  located 
in  the  midlatitudes  and  has  no  zonal  structure.  The  amplitude  is  chosen  so  that  there  is  a 
source  of  ‘heat’  at  the  equator  and  a  sink  at  the  poles,  as  shown  in  Figure  1. 

2.3  Efficiencies  of  Sub-Optimai  Fiows 

The  fmincon  routine  is  an  iterative  process  that  requires  an  initial  velocity  field  sufficiently 
close  to  the  optimal  solution  in  order  to  converge  on  a  minima  in  (|#|2)  with  respect  to  u. 
Fmincon  can  only  identify  local  minima  and,  at  this  time,  we  are  unable  to  state  whether  or 
not  these  optimal  solutions  minimize  (|0|2)  globally.  The  solution  space  is  littered  with  local 
minima  such  that  a  ‘good’  choice  of  an  initial  velocity  field  is  crucial  for  both  convergence 
and  for  the  best  chance  at  finding  a  true  optimal  velocity  field  u. 

In  order  to  determine  what  constitutes  a  ‘good’  initial  guess,  we  begin  our  investiga¬ 
tion  of  optimal  flows  by  determining  the  efficiencies  of  the  sub-optimal  flows  generated  by 
monochromatic  streamfunctions.  Thiffeault  [6]  note  that  the  velocity  fields  that  most  effi¬ 
ciently  mix  a  tracer  in  the  presence  tends  to  be  flows  that  directly  transport  the  tracer  from 
regions  of  excess  8  (‘hot’  regions,  if  we  imagine  the  tracer  is  temperature)  to  sink  (‘cold’) 
regions. 

For  the  geophysical  source  in  Figure  1  and  following  the  observations  of  optimized  source 
functions  in  Thiffeault  and  Pavliotis  [8],  the  most  optimal  flow  would  likely  be  cellular  with 
the  strongest  velocities  over  the  maxima  or  minima  of  the  source  and  nodes  located  near 
the  mean  value  of  s(x).  The  geophysical  source  function  achieves  its  mean  value  in  the 
midlatitudes  such  that  we  would  expect 

1  dtp  1  dip 
a  dcp'  a  cos  (p  <9  A 

in  these  regions.  Using  (21),  the  streamfunction  ip  of  this  efficient  flow  would  then  likely 
achieve  its  extrema  in  the  midlatitudes  and  have  an  inflection  point  near  the  equator.  In 
terms  of  spherical  harmonics,  a  streamfunction  with  wavenumbers  satisfying  n  —  m  =  1, 
such  as  the  monochromatic  spherical  harmonic  functions  K,'  and  P32,  are  good  choices  for 
'ip. 

Indeed,  ip  oc  is  the  most  efficient  sub-optimal  streamfunction  at  Pe  =  10  over  the 
range  of  monochormatic  streamfunctions  shown  in  Figure  2.  Streamfunctions  proportional 
to  spherical  harmonic  modes  Y™  with  n—m  =  1  tend  to  have  the  highest  mixing  efficiencies, 
as  expected,  while  those  streamfunctions  with  n  —  m  =  0  (inter-hemispheric  flows)  are 
slightly  less  efficient.  In  both  the  n  —  m  =  1  and  n  —  m  =  0  flows,  stirring  enhances  mixing 
such  that  £  >  1.  Velocity  fields  described  by  streamfunctions  with  less  zonal  structure,  or 
with  smaller  m,  tend  to  be  more  efficient  than  those  with  larger  values  of  m,  with  an  optima 
in  £  around  m  =  2  for  both  n  —  m  =  1  and  n  —  m  =  0.  Good  initial  conditions  for  ip  in  the 
optimization  algorithm  would  then  be  the  monochromatic  streamfunctions  proportional  to 
V32  and  Y], 

One  surprising  conclusion  from  Figure  2  is  that  some  flows  with  n  —  m  =  2  are  in  fact 
inefficient.  For  these  flow  fields,  stirring  actually  increases  (|#|2)  relative  to  the  unstirred 
solution  <|6»0|2)  resulting  in  £  <  1.  It  is  clear  that  such  flow  fields  would  not  be  good  initial 
guesses  for  the  optimization  problem  as  stirring  acts  to  increase  the  variance  of  the  steady 
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Ylj  SPHERE:  Dependence  on  Flow  Structure 
Pe  =  10 


Figure  2:  Sub-optimal  efficiencies  £  for  streamfunctions  of  the  form  oc  stirring 

the  geophysical  source  field  with  n  —  m  =  0  (blue),  n  —  m  =  1  (green),  and  n  —  m  =  2  (red) 
for  zonal  wavenumbers  m  =  0  to  10  at  Pe  =  10. 

state  tracer  field.  These  inefficient  flows  are  a  unique  feature  of  spherical  geometry  and  of 
our  choice  of  steady,  time- independent  velocity  fields  as  will  be  shown  in  Section  3. 

3  Lower  Bounds  and  Inefficient  Stirring 

In  Section  2.3  we  determined  that  not  all  stirring  by  a  velocity  field  u  enhances  diffusive 
mixing.  We  obtained  a  range  of  sub-optimal  efficiencies  spanning  both  both  positive  and 
negative  values  of  £.  What  is  the  range  of  possible  £  for  the  sphere  and  how  does  this 
compare  to  that  of  the  torus?  We  will  answer  this  in  two  parts,  focusing  on  the  lower 
bounds  and  the  existence  of  inefficient  flows  here  and  on  the  scaling  of  the  most  efficient 
flows  in  Section  4. 

Previous  work  by  Shaw  et  al.  [5]  obtained  analytic  upper  and  lower  bounds  on  £  for  the 
torus  by  optimizing  over  the  tracer  field  9.  For  tracer  fields  stirred  by  a  uniform  velocity 
field,  where  X7u(x,  t)  =  0  at  each  instant  in  time,  and  supplied  by  a  monochromatic  source, 
Shaw  et  al.  [5]  find  that  the  mixing  efficiencies  are  bounded  by 

1  <  £2  <  1  +  Pe2/dL2k2s  (27) 

where  d  is  the  dimension  of  the  torus,  L  is  the  domain  size,  and  ks  is  the  wavenumber  of 
the  source.  The  Peclet  number  for  the  torus  is  the  same  as  that  of  the  sphere  given  in  (20) 
with  the  domain  size  L  being  the  characteristic  length  in  the  place  of  the  radius  a. 
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Eft  =  0.75234 

SPHERE:  onemode  ex  =  2£ 


(a)  Efficient  Flow,  £  =  2.44 


(b)  Inefficient  Flow,  £  =  0.75 


Figure  3:  Two  of  the  sub-optimal  flows  associated  with  the  parameter  sweep  in  Figure  2 
where  Panel  3(a)  is  associated  with  the  relatively  efficient  ffow  of  mode  m  =  2,  n  =  3  and 
where  Panel  3(b)  is  the  inefficient  ffow  of  mode  m  =  1,  n  =  3.  The  streamfunction  of  the 
velocity  field  are  plotted  on  top  of  colored  contours  of  the  steady-state  tracer  field  0. 


Shaw  et  al.  [5]  found  that  inefficient  flows  do  not  exist  on  the  torus  for  monochromatic 
sources,  such  as  the  geophysical  source  function  s(x)  oc  Ylj^A,  ji)  that  we  consider  here. 
For  such  monochromatic  sources,  stirring  over  the  torus  always  acts  to  decrease  the  (|0|2) 
in  steady  state  relative  to  the  unstirred  solution.  As  we  have  already  seen  from  Figure  2, 
inefficient  flow  not  only  exist  for  the  geophysical  source  on  the  sphere-they  are,  in  fact, 
fairly  common.  What  are  the  properties  of  these  inefficient  flows  that  lead  to  an  increase 
in  the  variance  of  9  relative  to  the  unstirred  solution? 

If  we  compare  the  most  efficient  sub-optimal  flow  of  Figure  2  to  the  most  inefficient  flow, 
we  find  that  the  concentration  of  the  tracer  field  in  steady  state  between  Panel  3(a)  and 
Panel  3(b)  are  quite  different.  The  efficient  flow  has  stagnation  points  centered  near  the 
mean  value  of  the  source  function  (located  in  the  midlatitudes  for  the  geophysical  source) 
and  strong  zonal  velocities  near  the  source  maxima  and  minima.  In  steady  state,  the 
warm  patches  of  tracer  are  advected  off  equator  in  thin  jets  while  cold  regions  are  advected 
equatorwards,  leading  to  efficient  mixing  of  the  tracer  field  relative  to  the  diffusive-only 
solution. 

The  inefficient  flow  in  Panel  3(b)  has  stagnation  points  centered  on  the  equator  at 
the  region  of  maximum  tracer  input  and  strong  flows  parallel  to  lines  of  constant  9.  The 
maximum  ‘heating’  at  the  equator  is  trapped  at  the  stagnation  points  of  the  flow  while  the 
meridional  flow  everywhere  is  relatively  weak.  Without  the  ability  to  remove  excess  tracer 
at  the  equator  and  transport  it  to  the  sink  at  the  poles,  this  flow  tends  to  increase  the 
tracer  variance  in  steady  state  relative  to  the  diffusive-only  solution  and  thus  has  a  mixing 
efficiency  of  £  <  1. 

As  for  inefficient  flows  on  the  torus,  Shaw  et  al.  [5]  identify  that  £  satisfies 

c2 (N2)  ^  4?r2  ([A-^l2)  _  Efc(Lfc/27r)~4|gfc|2  f9f<) 

"  (m  -  L2  <|V-iS|2>  £*,  (Lk/2ir)~2  |S~fc|2  1  j 
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which  they  then  solve  using  a  variational  problem  in  6  to  maximize  the  stirred,  steady  state 
(l^j2)  subject  to  the  steady  state  constraint  ^OV#)2)  =  (sO). 

The  results  of  Shaw  et  al.  [5]  for  a  monochromatic  source  are  due  in  part  to  the  similarity 
between  the  the  gradient  and  Laplacian  operators  in  Fourier  space,  evident  in  the  similarity 
in  the  numerator  and  denominator  of  the  last  term  in  (28).  For  a  monochromatic  source  of 
wavenumber  ks,  the  Laplacian  and  gradient  operators  are  simply 

V  =  iks  A  =  -k2s  (29) 

so  that  the  Laplacian  operator  is  the  square  of  the  gradient  operator. 

In  terms  of  spherical  harmonics,  however,  the  gradient  operator  is  significantly  different 
from  the  Laplacian.  The  Laplacian  of  the  spherical  harmonic  function  Y™  is  given  by 

A1T(A,m)  =  +  (30) 

az 

While  the  gradient  operator  is  similar  to  that  of  the  torus  in  the  zonal  direction, 

Lyr(A,rt  =  imr„”>,  (31) 

the  meridional  derivative  of  A,  fi)  is  a  recursion  relation 

(1  -  M2)  n)  =  -nvn+i,mY£l i(A,  n)  +  (n  +  l)vn,mY™_ X(A,  n)  (32) 

/  2  2  \  1/2 

where  un,m  =  (  )  •  On  the  sphere,  the  Laplacian  operator  of  a  monochromatic 

source  function  is  not  equal  to  the  square  of  the  gradient  operator.  The  simplification  of 
the  efficiency  given  in  (28)  from  Shaw  et  al.  [5]  will  not  apply  here.  It  will  thus  be  necessary 
to  find  an  alternative  approach  to  solving  for  the  analytic  lower  bound  on  £  in  spherical 
coordinates. 

4  Upper  Bounds  and  Peclet  Number  Scaling 

The  upper  bound  given  in  (27)  from  Shaw  et  al.  [5]  suggests  that  the  efficiencies  on  the 
torus  scale  linearly  in  the  Peclet  number  for  large  Pe.  Thiffeault  [6]  also  notes  that  for 
optimization  problems  on  the  torus,  the  optimal  efficiencies  scale  quadratically  for  small 
Pe.  Do  these  scalings  in  Peclet  number  hold  for  the  sphere? 

In  order  to  determine  the  efficiencies  of  optimal  flows,  we  must  first  solve  the  optimiza¬ 
tion  problem  given  in  (18).  We  can  use  the  highly  efficient,  sub-optimal  flows  from  Figure  2, 
namely  the  flows  with  ip  oc  T32(A,  /i)  and  ip  oc  Y2(X,  /i)  as  initial  guesses  at  Pe  =  10  to  obtain 
the  two  optimized  flows  shown  in  Figure  4. 

Both  flows  in  Figure  4  have  characteristics  similar  to  the  efficient,  but  sub-optimal  flows 
of  Figure  2.  The  optimal  flows  tend  to  have  stagnation  points  near  the  mean  value  of  the 
source  function  and  both  have  inter-henrispheric  cells  to  transport  the  tracer  from  high 
concentrations  at  the  equator  to  low  concentrations  near  the  poles.  The  efficiency  of  the 
optimized  flow  with  m  =  2  (Panel  4(a))  is  slightly  higher  than  that  of  the  optimized  flow 
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Eff  =  3.2953 

SPHERE:  onemodeNOEN  ex  =  2 


Eff  =  3.1927 

SPHERE:  onemodeNOEN  ex  =  1 


(a)  Optimized  Flow,  8  =  3.30 


Max  Streamf unction:  1 .55  Contour  Interval:  0.3 

Converged:  Optimized 


(b)  Optimized  Flow,  8  =  3.19 
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Figure  4:  Two  optimized  flows  with  initial  guesses  guided  by  the  efficient,  sub-optimal  flows 
Y22  and  Y32  shown  in  Figure  2.  Contours  of  the  streamfunction  are  superimposed  on  filled, 
colored  contours  of  the  steady  state  tracer  concentration. 


Figure  5:  Efficiencies  £  —  1  of  optimized  flows  (blue  circles)  and  sub-optimal  flows  (black 
lines)  as  a  function  changing  Peclet  number.  The  efficiencies  of  sub-optimal  flows  are  those 
arising  from  streamfunctions  proportional  to  (dashed)  and  (solid).  Red  dashed  lines 
are  the  predicted  scaling  for  the  most  efficient  flows  at  low  Pe  (quadratic  scaling)  and  at 
high  Pe  (linear  scaling). 
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Max  Streamf unction:  0.00  Contour  Interval:  0.0 


Figure  6:  The  cellular  source  on  the  sphere  mapped  in  latitude-longitude  coordinates. 

with  m  =  1  (Panel  4(b))  at  £  =  3.30  as  compared  to  £  =  3.19.  Expectedly,  both  flows  have 
efficiencies  greater  than  the  most  efficient  sub-optimal  flow  in  Figure  2. 

While  we  were  able  to  obtain  these  two  optimal  flows  at  Pe  =  10,  it  was  not  without 
difficulty.  As  state  in  Section  2.3,  the  optimization  algorithm  for  the  sphere  is  an  iterative 
process  that  will  converge  on  local  minima,  if  it  converges  at  all.  Unfortunately,  due  to 
time  limitations,  very  few  optimal  solutions  were  found  while  for  varying  Pe.  Those  that 
we  could  identify  are  plotted  in  Figure  5  along  with  the  efficiencies  of  sub-optinral  flows. 
With  the  exception  of  one  point  at  Pe  =  10,  most  all  optimized  solutions  lie  very  near  that 
of  the  initial  guess  (the  Yj  mode). 

In  place  of  looking  at  the  optimized  flows,  we  examine  the  scaling  of  the  efficiencies  of 
the  sub-optinral  ‘best  guess’  streamfunctions  with  monochornratic  or  Pg2  modal  structure 
instead  of  the  true  optimal  solutions.  From  Figure  5,  we  see  that  the  maximum  efficiencies 
of  the  Y-2  and  U2  modes  appear  to  scale  in  agreement  with  the  observations  of  Thiffeault 
[6]  and  of  Shaw  et  al.  [5]  for  the  torus,  namely  that  the  mixing  efficiencies  scale  as  Pe  for 
small  Pe  and  transition  to  a  Pe 2  scaling  at  high  Pe. 

The  scaling  of  £  with  Pe  even  holds  if  we  move  away  from  the  geophysical  source.  If  we 
consider  a  cellular  source  on  the  sphere  (Figure  6)  and  its  equivalent  on  the  torus,  we  can 
compare  the  scaling  of  the  efficiencies  between  the  two  geometries  directly.  The  efficiencies 
of  sub-optimal  flows  on  the  sphere  (Panel  7(a))  and  both  optimal  and  sub-optinral  flows  on 
the  torus  (Panel  7(b))  are  plotted  against  Pe  in  Figure  7. 

There  are  two  primary  observations  to  note  in  Figure  7:  the  close  proximity  of  optimized 
flows  on  the  torus  to  the  sub-optimal  cellular  flows  and  the  agreement  with  Pe  scaling 
arguments.  For  optimized  flows  on  the  torus,  as  with  the  optimized  flows  on  the  sphere  in 
Figure  5,  the  optimal  solutions  lie  very  close  to  the  ‘best  guess’  sub-optinral  flows.  This 
suggests  that  the  optimized  efficiencies  on  the  sphere  for  the  cellular  source  should  be 
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Figure  7:  Scaling  of  efficiencies  <5  —  1  with  respect  to  changing  Pe  for  the  cellular  source 
function  (shown  in  Figure  6  for  the  sphere)  on  the  sphere  (Panel  7(a))  and  for  the  equivalent 
flow  on  the  torus  (Panel  7(b)).  Efficiencies  of  sub-optimal  flows  of  Yj1  (solid)  and  Y22 
(dashed)  over  the  surface  of  the  sphere  are  shown  in  Panel  7(a).  Similarly,  sub-optimal 
flows  on  the  torus  are  shown  for  kx  =  1,  ky  =  1  (solid)  and  kx  =  1,  ky  =  0  (dashed)  in 
Panel  7(b).  Optimized  flows  for  the  torus  are  shown  in  circles.  Quadratic  and  linear  Pe 
slopes  are  red-dashed. 

relatively  close  to  the  sub-optinral  flows  shown  in  Figure  7,  however  this  still  needs  to  be 
shown  in  future  work. 

The  close  agreement  to  the  Pe  scaling  between  the  torus  and  the  sphere  suggests  that, 
unlike  the  lower  bound  on  £,  one  property  of  maximally  efficient  ffows  from  the  torus  does 
carry  over  to  the  sphere.  Why,  then,  does  the  efficiency  scale  linearly  in  Pe  at  high  Pe  and 
as  Pe2  at  small  Pe ?  Why  is  this  result  independent  of  geometry? 

At  high  Pe,  we  can  approximate  the  effect  of  stirring  by  and  eddy  diffusivity,  neff,  that 
is  significantly  larger  than  the  molecular  diffusivity,  n.  For  an  advection-dominated  regime, 
an  effective  diffusivity  can  be  generated  by  Keff  ~  UL,  where  U  is  the  characteristic  length 
of  the  advective  velocity  and  L  is  a  characteristic  mixing  length  [4,  5].  For  the  cellular  flows 
we’ve  considered  here,  where  the  most  efficient  ffows  tend  to  be  those  of  the  largest  scale 
permissible,  the  characteristic  length  scale  is  that  of  the  domain  for  the  torus  or  that  of 
radius  for  the  sphere. 

The  stirred  steady  state  advection-diffusion  equation  (1)  is  then  parameterized  by 

KefjA8  ~  s  (33) 

while  the  unstirred,  diffusive  steady  state  problem  is  unchanged 

kA#o  =  s  (34) 

so  that  the  solutions  to  the  steady  state  stirred  and  unstirred  systems  are 

0&—  A"1*  0o=-A~1s.  (35) 

Heff  K 
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The  efficiency  £  is  then  approximated  by 


£ 


Keff(  1^  1 


(36) 


which  scales  as 


£ 


Keff 


UL 


K 


=  Pe 


(37) 


where  we  have  used  neff  ~  UL.  From  (37),  the  mixing  efficiency  £  should  scale  linearly 
with  Pe  so  long  as  Pe  is  sufficiently  large  such  that  the  mixing  is  advection-dominated  and 
the  advection-diffusion  equation  can  be  approximated  by  an  eddy  diffusivity. 

The  diffusion-dominated  small  Pe  regime,  on  the  other  hand,  appears  to  scale  quadrati- 
cally  in  Pe  for  the  cases  we  have  considered  here.  Consider  the  advection-diffusion  operator 
C,  given  in  (9)  in  the  limit  of  small  Pe.  By  definition  the  Peclet  number  is  the  scale  of 
the  advection  term  relative  to  the  diffusive  term  in  (1)  and  could  be  considered  a  small 
parameter  in  the  steady-state,  nondimensional  advection-diffusion  equation 


Peu  •  V  —  A  )  6  =  s 


(38) 


where  hatted  variables  are  nondimensional.  We  have  assumed  here  that,  to  a  first  approxi¬ 
mation,  the  size  of  the  source  function  scales  as  Ad  in  the  limit  of  small  Pe. 

The  inverse  of  the  nondimensional  operator  £_1  =  ^ Peu  •  V  —  A^j  can  then  be  ap¬ 
proximated  by  expanding  about  Pe  <C  1.  Let  A  =  u  ■  V  be  the  advection  operator  and  let 
D  =  —A  be  the  diffusion  operator.  Additionally,  let  e  =  Pe  be  the  small  parameter.  The 
nondimensional  advection-diffusion  operator  (dropping  the  hats)  is  then 


£  =  D  +  eA. 


(39) 


The  inverse  of  (39)  can  then  be  expanded  out  to  O(e) 

C~l  =  D~l  -  eD^AD-1  +  0(e2) 


such  that  (I#)2)  =  (| £  1s|2)  is 

(\9\2)  =  ^  f  0 C~ls)2dV 

=  ^  j  ( D~ls  -  eD~1AD~1s  +  0{e2)f  dV 

Expanding  the  square  of  (41)  yields 

( |0| 2)  =  y  J  (L>_1.s)2  -  2e  {D^.s)  (D-'AD-'s)  dV  +  0(e2) 

The  advection  operator  can  be  written  in  terms  of,  At;  =  [^;,  £],  where 

,  ,  _  dip  dip  dtp  dil’  _  Op  dp 
^  ’  dx  dy  dx  dy  dy  1  dx 


(40) 


(41) 


(42) 


(43) 
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on  the  torus,  noting  that  the  non-divergent  advective  velocities  are  defined  using  a  stream- 
function  ip  by  u  =  —  ^  and  v  =  On  the  sphere, 


[Vk  £]  = 


1  dip\  ( \ 


a  cos  (j>  d\  J  \a  dcp 


1  d£\  f  1  dip 


a  cos  (j)d\  J  \  a  dp 


for  the  non-divergent  velocities  given  in  (21). 

Replacing  the  advection  operator  in  (42)  with  [ip,£]  yields 

(\0\2)  =  y  j  (D-h)2  -2e{D~1S)  (D-'^D-hfidV +  ()(<?) 

such  that  the  variation  in  (|0|2)  given  a  change  in  the  streamfunction  Sip  is 

S{\0\2)  =  -y  J  (D-h)  D -1  [SiP.D-h]  dV  +  0(e2) 

=  -y  J  Sip  [D-'s,  D~2s]  dV  +  0(e2). 

To  0(e2),  the  variation  of  (|0|2)  is  then 


Sip 


^  =  — 2e(  [.D_1s,  D~2s\  )  +  0(e2). 


(44) 


(45) 


(46) 

(47) 


(48) 


For  any  source  that  satisfies  [D~1s,  D~2s^  =  0,  such  as  monochromatic  sources,  the 
variation  of  (|#|2)  with  respect  to  a  change  in  the  streamfunction  Sip  will  be  zero  at  0(e). 
As  both  the  geophysical  and  the  cellular  sources  considered  here  are  monochromatic,  we  can 
expect  (|#|2)  to  vary  as  e2  =  Pe2,  hence  the  quadratic  scaling  in  the  Peclet  number  at  small 
Pe.  If  we  were  to  instead  look  flows  which  optimized  £  on  the  sphere  with  a  source  function 
satisfying  D~~2s\  ^  0,  we  might  expect  linear  scaling  at  small  Pe.  Experiments  so 

far,  however,  have  been  inconclusive.  Finding  sources  where  efficiencies  scale  linearly  in  Pe 
is  left  for  future  work. 


5  Sweeping  Flow 

It  is  interesting  to  note  that  the  flow  appears  to  saturate  in  efficiency  around  Pe  =  102 
in  Figure  5  for  the  geophysical  source.  This  saturation  leads  to  a  crossing  of  the  efficiencies 
of  sub-optimal  flows.  Although  streamfunctions  with  a  modal  structure  of  Y^2  are  more 
efficient  mixers  than  those  of  at  small  Pe,  l^1  is  more  efficient  than  the  Y2  mode  at  high 
Pe. 

The  saturation  of  £  at  high  Pe  for  the  Ig2  streamfunction  can  be  understood  by  looking 
at  the  steady  state  snapshots  of  9  and  ip  in  Figure  8.  At  Pe  =  103,  the  tracer  field  becomes 
trapped  near  the  stagnation  points  of  u.  For  the  saturated  streamfunction  in  Panel  8(b), 
the  flow  velocity  is  largely  parallel  to  contours  of  constant  8  in  steady  state  such  that  the 
velocity  field  is  no  longer  stirring  the  tracer  field  as,  for  these  flows,  u  ■  X79  ~  0.  As  the 
advective  term  of  (7)  is  small,  increasing  the  Peclet  number  further  by  either  decreasing  the 
diffusivity  or  increasing  the  strength  of  the  velocity  field  u  will  do  little  to  change  the  mixing 
efficiency,  hence  £  will  not  change  greatly  with  increasing  Pe  as  observed  in  Figure  5. 
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Elf  =  25.628 

SPHERE:  highPE3221  ex=  1 


(a)  High  Pe ,  £  =  104.06,  Y2] 


(b)  High  Pe,  £  =  25.63,  Y32 


Figure  8:  Flows  given  with  a  streamfunction  structure  of  in  Panel  8(a)  and  Y 32  in  Panel 
8(b)  at  Pe  =  103  in  the  region  of  efficiency  saturation.  Streamlines  (black)  are  superimposed 
on  colored  contours  of  the  stirred,  steady-state  9. 


We  are  interested  in  finding  the  velocity  field  that  most  efficiently  mixes  the  a  tracer 
over  the  surface  of  the  sphere  for  a  given  source  function,  however  we  have  only  considered 
time  independent  velocity  fields  that  contain  stagnation  points  in  u.  The  presence  of  these 
stagnation  points  limits  the  mixing  efficiency  of  these  steady  ffows  at  high  Pe.  We  may  be 
able  to  achieve  higher  £  if  we  allow  the  flow  to  vary  in  time  such  as  to  effectively  remove 
these  stagnation  points  on  average. 

One  example  of  such  an  efficient,  time- varying  ffow  for  the  torus  is  found  in  Shaw  et  al. 
[5].  Shaw  et  al.  [5]  are  able  to  saturate  the  upper  bound  of  £  given  in  (27)  using  a  ‘sweeping’ 
flow.  This  flow  is  characterized  by  a  constant  velocity,  U,  that  alternates  between  a  purely 
zonal  flow  and  a  purely  meridional  flow  over  the  torus  over  long  timescales.  Does  such  a 
sweeping  flow  exist  for  the  sphere? 

A  sweeping  flow  as  described  in  Shaw  et  al.  [5]  for  the  torus  cannot  be  defined  on  the 
sphere  in  the  same  manner  as  it  is  on  the  torus  due  to  the  presence  of  the  poles.  The 
velocity  field  u  must  be  zero  at  the  poles  because  of  artificial  discontinues  in  the  gradient 
operator  in  spherical  coordinates.  If  we  were  to  consider  a  velocity  field  that  was  purely 
zonal  at  one  instant  in  time  it  would  be  require  to  have  the  amplitude  of  u  =  u(4>)\  to  go 
to  zero  as  0  — >  ±90°.  A  rotation  of  this  flow  by  90°  on  the  sphere  would  not  produce  the 
same  purely  meridional  flow  as  in  the  sweeping  flow  on  the  torus.  Instead,  it  would  produce 
a  flow  with  strong  meridional  velocities  over  the  poles  and  with  two  stagnation  points  at 
the  equator.  With  a  fixed  coordinate  system  this  rotated  ffow  violates  the  condition  that 
\u\  =  0  at  the  poles. 

A  sweeping  flow  on  the  sphere  would  then  require  more  than  just  a  rotation  of  the 
velocity  field.  One  method  to  obtain  a  sweeping  flow  on  the  sphere  is  described  by  Finn  [2]. 
Finn  [2]  considers  a  sweeping  flow  produced  by  a  rotation  of  both  the  velocity  field  and  the 
source  function  or,  equivalently,  a  rotation  of  the  coordinate  system. 
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Perhaps  the  simplest  sweeping  flow  over  the  sphere  would  have  an  initial  velocity  field 


u  = 


U  cos{4>)\ 


mixing  a  tracer  supplied  by  the  source  function  given  by 


s(X,n)  =  SPi(n)  (exp(-iA)  +  exp(iA))  =  S  +  Yx  1(X ,//))  . 


(49) 


(50) 


This  source  function  is  not  identical  to  the  rotate  geophysical  source.  It  instead  describes  a 
sphere  heated  in  the  western  hemisphere  and  cooled  in  the  eastern  hemisphere.  We  choose 
this  particular  source  instead  of  the  rotated  geophysical  source  in  Finn  [2]  as  simplifies  the 
advective  term  significantly  upon  rotation  of  the  velocity  field. 

We  allow  the  velocity  field  given  in  (49)  to  mix  the  source  from  (50)  to  steady  state 
before  rotating  the  velocity  field  by  90°.  Upon  rotation,  the  velocity  field  will  lie  parallel  to 
lines  of  constant  (j)  as  supplied  by  the  source  function  s(A,  /j)  from  (50)  such  that  u-V6  =  0. 
As  the  velocity  field  does  not  advect  the  tracer  away  from  the  source,  all  stirring  enhanced 
mixing  occurs  in  the  first  half  of  the  sweeping  flow  when  u  is  given  by  (49). 

The  mixing  efficiency  of  the  sweeping  flow  given  by  (49)  over  the  sphere  can  be  calculated 
by  first  finding  the  steady  state  tracer  concentration  satisfying  the  steady-state  advective 
diffusive  equation  of  (7).  Using  the  gradient  and  Laplacian  formulations  given  in  (31)  and 
(30),  the  tracer  concentration  in  steady  state  satisfies 


E  E 

\n= 0  m=—n 


im  CC(A,|.) 


+*E  E 

n= 0 m=—n 

=  S  (^(A./i)  +  Yi-1(A,/i)) 


such  that  the  we  can  solve  for  the  9™  by  matching  coefficients, 


By  Parsevals’  theorem,  the  variance  of  0  is  then 

oo  n 

{|9|2>  =  2,^  XI  |C 

n=0  m=—n 


(51) 


(52) 


(53) 


which,  given  the  coefficients  from  (52),  allows  us  to  find  the  steady  state  tracer  variance  of 
the  stirred  problem 


(|0|2)  =  4v tS2 


3  U2\ 
+  2at) 


(54) 
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By  taking  U  =  0  in  (54),  we  can  similarly  find  the  steady  state  tracer  variance  of  the 
unstirred  problem 


<|0or>  =  4v rS"  4- 


2  \  -1 


(55) 


we  can  calculate  the  efficiency  of  the  sweeping  flow  as 


f2  = 


=  ^[^  +  ~)  =l  +  ^Pe 


4tc2 


2  a2 


(56) 


where  Pe  =  — . 

If  we  were  to  compare  the  mixing  efficiency  of  the  sweeping  flow  over  the  sphere  to  the 
upper  bound  for  the  mixing  efficiency  of  a  monochromatic  source  similar  to  that  of  (50) 
on  the  torus  in  (27),  we  would  expect  that  the  most  efficient  sweeping  flow  over  the  torus 
would  have  a  mixing  efficiency  of 

£  torus  =  1  +  (57) 

where  the  source  function  on  the  d  =  2  torus  is  described  by 

s(x,  y )  =  St0rus{elx  +  e~lx )  (58) 

in  a  domain  of  length  L  =  1  with  a  total  wavenumber  of  k2  =  1. 

In  comparing  (56)  to  (57),  we  note  that  the  mixing  efficiency  of  the  sweeping  flow  on 
the  sphere  is  very  close  to  that  of  the  torus.  The  effiency  £ 2  of  the  sweeping  flow  on 
the  sphere  scales  as  | Pe2  as  compared  to  a  scaling  of  \ Pe 2  on  the  torus.  For  the  same 
Peclet  number,  the  sweeping  flow  over  the  torus  will  be  slightly  more  efficient  than  the 
corresponding  sweeping  flow  on  the  sphere. 

Additionally,  the  sweeping  flow  over  the  sphere  has  an  efficiency  £  =  \J  1  +  |Pe2  which 
scales  as 


£  « 


\J\Pe  for  Pe^>  1 
1  +  TaPe2  for  Pe  <C  1 


(59) 


such  that  £  scales  linearly  in  Pe  at  high  Pe  and  quadratically  in  Pe  at  low  Pe  in  agreement 
with  the  arguments  for  highly  efficient  flows  found  in  Section  4. 

While  Shaw  et  al.  [5]  found  that  the  sweeping  flow  on  the  torus  is  the  most  efficient 
flow  for  a  monochromatic  source  function,  we  are  not  guaranteed  that  the  sweeping  flow 
described  by  (49)  is  the  most  efficient  flow  for  the  the  sphere.  One  particular  issue  with 
any  flow  on  the  sphere  arises  due  to  the  location  of  stagnation  points  in  the  flow. 

As  we  must  have  |u|  =  0  at  the  poles  for  any  rotated  coordinate  system  we  choose,  we 
will  always  have  stagnation  points  in  the  velocity  field  over  the  poles.  It  would  then  always 
be  possible  to  achieve  the  trapping  behavior  described  for  inefficient  flows  in  Section  3  if 
these  polar  stagnation  points  are  located  away  from  the  mean  value  of  the  source  function. 
(One  could  consider  regions  where  the  source  is  equal  to  its  mean  as  already  well-mixed, 
thus  stirring  by  u  could  do  little  to  reduce  the  variance  in  these  regions.)  This  is  one  reason 
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why  we  chose  the  source  given  in  (50)  rather  than  the  rotated  geophysical  source.  The 
rotate  geophysical  source  from  Finn  [2],  given  by 

s( A,  At)  =  --^-SPziv)  (exP  (— 2*A)  +  exp  (2iA))  +  -^=ST2°  (m)  *  (60) 

would  have  a  maxima  over  the  poles.  These  regions  could  never  be  mixed  for  a  fixed  spherical 
coordinate  system,  leading  to  a  saturation  of  £  with  increasing  Pe  similar  to  the  suboptimal 
flows  stirred  by  a  streamfunction  proportional  to  in  Figure  5  and  in  Panel  8(b). 

Finn  [2]  suggests  a  more  efficient  sweeping  flow  could  be  achieved  by  a  careful  rotation  of 
the  coordinate  system  at  each  time  interval.  If  the  poles  of  the  spherical  coordinate  system 
are  always  located  in  regions  where  the  source  function  achieves  its  mean  value  one  could 
bypass  the  trapping  issue  on  the  sphere  and  possibly  find  the  most  efficient,  time-dependent 
flows  on  the  sphere. 

6  Discussion  and  Future  Directions 

In  this  work  we  sought  answers  to  the  two  following  questions:  1)  what  flows  most  efficiently 
mix  a  tracer  field  on  the  surface  of  the  sphere  that  is  supplied  by  a  spatially-varying  source 
function?  and  2)  how  do  the  properties  of  these  flows  compare  to  similar  velocity  fields  on 
the  torus?  We  were  able  to  find  flows  that  maximized  the  mixing  efficiency  given  in  (5) 
using  a  numerical  optimization  scheme.  This  optimization  scheme,  however,  only  optimizes 
the  velocity  field  according  to  one  measure  of  the  mixing  efficiency.  In  determining  the  flows 
that  maximize  other  possible  measures  of  mixing  efficiency,  such  as  the  multi-scale  mixing 
efficiencies  described  in  Shaw  et  al.  [5]  given  by 

c-2  (IV^OI2) 
p  (|VP0|2) 

where  p  =  —1,0,1,  we  could  emphasize  the  homogenization  of  variance  of  6  on  smaller 
(p  =  1)  and  larger  (p  =  —1)  spatial  scales. 

In  changing  geometries  we  find  that  many  of  the  properties  of  optimized  flows  over  the 
torus  do  not  translate  directly  to  that  of  the  sphere.  The  hard  lower  bound  of  1  <  £  of 
Shaw  et  al.  [5] ,  for  example,  is  violated  for  the  geophysical  source  on  the  sphere  due  to  the 
inescapable  presence  of  stagnation  points  in  u  and  the  trapping  of  the  tracer  concentration 
at  its  source. 

The  issue  of  stagnation  points  in  the  stirring  field  on  the  sphere  also  has  consequences 
for  saturating  an  upper  bound,  should  such  an  upper  bound  be  discovered  in  the  future.  To 
remove  these  stagnation  points  we  must  consider  time-dependent  velocity  fields  similar  to 
the  sweeping  flow  described  by  Shaw  et  al.  [5]  modified  by  the  solid  body  rotation  method 
introduced  by  Finn  [2].  It  may  be  possible  to  determine  the  most  efficient  time-dependent 
flow  for  a  given  source  function  on  the  sphere  by  using  the  method  of  steepest  decent 
described  in  Lin  et  al.  [3]. 

One  similarity  between  efficiencies  of  optimal  (or  sub-optimal,  in  the  case  of  the  sphere) 
flows  for  the  two  geometries  is  that  of  the  scaling  of  £  with  changing  Pe.  For  both  the 
geophysical  and  the  cellular  source  functions,  the  efficiency  £  scales  linearly  with  Pe  at  high 
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Pe  and  quadratically  with  Pe  at  small  Pe.  For  diffusion-dominated  flows  at  small  Peclet 
number  we  expanded  the  advection-diffusion  integral  operator,  C~l  around  the  diffusive 
term  and  determined  that  the  £  ~  Pe 2  scaling  in  this  regime  was  a  consequence  of  our 
choice  of  monochromatic  sources.  It  is  an  open  question  as  to  whether  we  can  obtain  linear 
Peclet  number  scaling  for  non-monochromatic  sources,  such  as  sources  with  off-equatorial 
heating. 

As  we  are  motivated  here  by  the  geophysical  question  of  the  redistribution  of  heat  over 
the  surface  of  the  sphere,  the  optimization  of  £  subject  to  more  complex  source  functions 
are  of  significant  interest.  Some  source  functions  in  particular  are  those  with  off-equatorial 
heating,  as  mentioned,  to  simulate  seasonal  changes  in  solar  insolation.  Additional  source 
functions  we  could  consider  include  those  with  longitudinally  localized  heating  to  simulate 
land-sea  surface  temperature  contrasts. 

Throughout  this  work  we  have  considered  a  velocity  field  that  is  independent  of  tem¬ 
perature,  however  in  a  geophysical  context  these  two  fields  are  not  wholly  independent.  It 
may  be  of  further  interest  to  modify  the  code  to  allow  for  coupling  between  the  velocity 
field  and  the  tracer  field. 

The  energy  constraint  here  follows  from  previous  work  on  the  torus.  However,  this  is 
not  the  only  dynamical  constraint  we  may  wish  to  explore  on  the  sphere.  If  we  were  to 
include  some  form  of  rotation  to  our  model  of  optimized  flows  over  the  sphere,  we  could 
additionally  consider  an  angular  momentum  constraint  in  the  form  of 

M  =  £la2  cos2  <j)  +  ua  cos  4>  (62) 

where  M  is  the  angular  momentum,  II  is  the  rotation  rate,  a  is  the  radius  of  the  sphere, 
and  u  is  the  zonal  component  of  the  non-divergent  velocity  field  u. 

On  the  Earth,  the  redistribution  of  heat  on  the  sphere  from  large-scale  atmospheric 
stirring  is  not  primarily  achieved  by  the  barotropic  circulations  we  have  considered  here, 
but  instead  by  the  near-zonally  symmetric  circulations  of  the  Hadley  and  Ferrel  cells  in 
height  2  and  latitude  4>  coordinates.  A  question  of  significant  geophysical  interest  would 
then  be  what  zonally  symmetric  circulation  pattern  in  z-cj)  coordinates  most  efficiently  mixes 
temperature  on  the  sphere?  Furthermore,  can  such  a  flow  be  obtained  using  a  numerical 
optimization  scheme  such  as  the  one  used  here? 
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Snail  locomotion 
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1  Introduction 

A  snail  propels  itself  by  deforming  a  layer  of  mucus  beneath  it  via  muscular  contractions 
in  its  foot  muscle.  This  unique  form  of  locomotion  allows  snails  to  traverse  almost  any 
obstacle,  including  the  ability  to  climb  steep  inclines.  Throughout  the  snail  kingdom,  there 
exist  a  number  of  variations  for  the  pattern  of  muscular  forcing  used  by  the  foot  to  activate 
locomotion  and  in  particular,  the  two  main  classes  of  snails,  namely  land  snails  and  water 
snails,  have  adopted  two  very  different  patterns.  Water  snails  prefer  a  method  in  which 
its  foot  applies  a  predominantly  vertical  force  upon  the  mucus  layer  as  a  wave  travelling  in 
the  opposite  direction  to  that  of  locomotion.  This  retrograde  locomotion  generates  forces 
in  a  similar  manner  to  that  of  a  peristaltic  pump,  whereby  the  forcing  surface  undergoes 
significant  perpendicular  deformation.  Land  snails  prefer  a  method  that  applies  a  wave  of 
tangential  muscular  contractions  that  travel  in  same  direction  to  that  of  locomotion.  Unlike 
the  retrograde  case,  this  prograde  locomotion  involves  significant  tangential  contraction  of 
the  base  of  the  foot  muscle,  and  little  vertical  deformation.  These  processes  involve  a 
rich  mixture  between  the  fluid  mechanics  of  the  mucus  layer  and  its  interaction  with  the 
muscular  and  elastic  mechanics  of  the  snail’s  foot. 

A  further  complication  is  introduced  by  the  non-Newtonian  rheology  of  snail  mucus, 
which  is  known  to  have  an  appreciable  yield  stress,  i.e,  a  minimum  stress  necessary  to 
deform  it.  Evolution  has  lead  to  this  rheology  because  it  allows  the  snail  to  remain  adhered 
to  a  steep  incline  without  having  to  apply  any  force  to  counteract  its  weight.  By  selectively 
forcing  the  mucus  in  a  wave-like  pattern,  the  snail  is  able  to  climb  the  incline  without 
slipping.  In  this  report  we  investigate  the  two  forms  of  snail  locomotion  outlined  above, 
and  further  attempt  to  quantify  the  role  of  yield  stress  in  locomotion. 

Studies  of  locomotion  in  fluids  began  with  the  seminal  work  of  Taylor  [4] ,  who  considered 
the  low- Reynolds  number  problem  of  a  two-dimensional  surface  surrounded  by  a  Newtonian 
fluid,  that  swims  by  exerting  a  normal  force  as  a  wave  travelling  in  the  opposite  direction 
to  that  of  locomotion.  By  performing  a  biharmonic  analysis  of  the  flow  in  the  limit  of 
a  low-amplitude  forcing,  he  relates  the  velocity  of  locomotion  to  the  parameters  of  the 
forcing.  This  problem  was  generalized  by  Katz  [3]  to  include  a  rigid  wall  lying  parallel  to 
the  swimmer,  which  he  investigates  using  both  the  biharmonic  approach  of  Taylor,  and  a 
lubrication  theory  for  the  thin-gap  limit.  More  recently,  Balmforth,  Coombs  and  Pachman 
[1]  extend  this  lubrication  theory  to  cases  involving  non-Newtonian  fluids  in  the  thin-gap 
geometry.  This  study,  which  considers  the  effect  of  a  yield  stress,  is  of  relevance  to  the 
process  of  retrograde  snail  locomotion.  A  model  of  prograde  snail  locomotion  is  investigated 
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main  body  Uy-) 


y  =  d  +  H 


Figure  1:  Schematic  of  the  foot  and  mucus  layer  in  retrograde  locomotion. 


by  Chan,  Balmforth  and  Hosoi  [2] ,  in  which  they  model  the  foot  by  prescribing  its  tangential 
displacement. 

In  this  project,  we  begin  in  section  2  with  an  investigation  into  the  process  of  retrograde 
snail  locomotion  that  extends  the  work  of  Balmforth  et  al.  [1]  by  solving  the  system  as  an 
initial- value  problem,  and  further  explores  the  effect  of  the  elastic  parameters  of  the  foot  on 
the  locomotion  velocity.  We  show  that  the  system  converges  toward  a  steady  state,  which 
we  proceed  to  study  by  solving  the  associated  eigenvalue  problem.  From  our  solutions,  we 
identify  a  critical  yield  stress  above  which  locomotion  does  not  occur,  and  further  relate 
the  locomotion  velocity  to  the  parameters  of  the  system.  In  section  3,  we  consider  the  case 
of  prograde  snail  locomotion  in  which  we  extend  the  work  of  Chan  et  al.  [2]  by  developing 
a  fully  dynamic  model  with  both  a  muscular  forcing  and  a  consideration  of  the  elastic 
mechanics  of  the  snail’s  foot.  We  proceed  to  study  the  resulting  system  numerically  and 
derive  some  analytical  results. 

2  Retrograde  locomotion 

2.1  The  foot 

We  assume  a  configuration  in  which  a  layer  of  mucus  of  average  height  H,  is  confined 
between  the  foot’s  surface  at  y  =  Y(x,t),  and  a  stationary  rigid  wall  at  y  =  0,  as  depicted 
schematically  in  figure  1.  The  foot  is  assumed  to  be  composed  of  a  linear  elastic  material, 
of  average  thickness  d,  that  lies  in  contact  with  the  main  body  of  the  snail  at  y  =  H  +  d, 
which  we  assume  to  be  a  rigid  solid  that  is  able  to  translate  horizontally  with  velocity  U(t). 
Further,  we  allow  the  foot  to  subject  itself  to  forces  via  internal  muscular  contractions,  which 
lead  to  elastic  deformation  and  hence  provide  a  means  to  activate  its  motion.  Analysis  of 
snail  anatomy  has  shown  that  these  contractions  are  produced  by  a  series  of  muscular  fibres 
inclined  at  varying  angles  that  lie  anchored  just  inside  the  main  body  of  the  snail  and  the 
base  of  the  foot.  By  choosing  the  contraction  pattern  of  these  inclined  fibres  appropriately, 
the  snail  is  able  to  force  different  points  of  its  foot’s  surface  in  a  variety  of  directions  and 
patterns. 
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In  retrograde  locomotion,  the  contraction  pattern  is  such  that  the  resultant  muscular 
forces  act  predominantly  normal  to  the  interface  between  the  mucus  layer  and  the  foot,  as 
shown  in  figure  1.  Anatomical  studies  of  snails  have  shown  that  the  top  surface  of  the  foot 
does  not  deform  significantly  into  the  main  body  of  the  snail.  Thus,  we  assume  that  the 
main  body  of  the  snail  acts  as  a  rigid  foundation,  such  that  the  top  surface  of  the  foot  does 
not  penetrate  the  main  body  above.  Assuming  the  foot  is  very  slender  where  L  is  a 

characteristic  length  scale  for  its  horizontal  variation,  the  leading-order  pressure  in  the  foot 
is  given  by 

p  =  Af(x,i)  +  DR(Y-H),  (1) 

where  Af  represents  the  muscular  forcing,  A  is  the  amplitude  of  the  forcing,  and  Dr  is 
the  vertical  stiffness  coefficient  that  can  be  expressed  in  terms  of  the  elastic  parameters  of 
the  foot.  The  elastic  force,  represented  by  the  second  term  on  the  right-hand  side  of  this 
equation,  provides  a  restoring  force  that  is  proportional  to  the  perturbation  of  the  thickness 
profile  from  H.  Thus,  we  can  consider  the  surface  of  the  foot  to  be  equivalent  to  a  sheet 
attached  to  the  main  body  of  the  snail  via  Hookean  springs  of  equilibrium  height  d,  as 
illustrated  in  figure  1,  where  Dr  represents  the  Young’s  modulus  of  the  springs.  Based  on 
observations  of  wave-like  patterns  of  muscular  contraction  that  propagate  along  the  feet  of 
water  snails,  we  assume  a  forcing  of  the  form 

/  =  /(**),  (2) 

where  £  =  x  +  ct.  For  positive  wave  speed  c,  equation  (2)  corresponds  to  a  wave  travelling 
in  the  negative  x-direction,  chosen  in  anticipation  that  this  will  lead  to  steady  locomotion 
in  the  positive  x-direction. 

The  absence  of  significant  tangential  forcing  implies  that  we  can  assume  that  the  foot 
undergoes  negligible  horizontal  deformation  to  leading  order.  In  this  case,  the  horizontal 
velocity  of  the  entire  foot  is  equal  to  that  of  the  main  body  of  the  snail  U,  so  that  the 
velocity  of  the  foot’s  surface  is  given  uniformly  by 

us  =  U(t).  (3) 

Having  established  the  fundamentals  of  the  foot  mechanics  in  retrograde  locomotion,  we 
proceed  to  consider  the  fluid  mechanics  of  the  mucus  layer,  which  we  subsequently  relate 
to  the  elastic  mechanics  of  the  foot  via  boundary  conditions  imposed  at  the  foot-mucus 
interface. 

2.2  The  mucus  layer 

We  assume  that  the  mucus  layer  is  composed  of  an  incompressible  yield-stress  fluid  that  is 
subject  to  forcing  from  the  foot  above.  We  proceed  to  detail  a  model  for  the  yield-stress 
rheology  of  the  fluid,  and  then  simplify  the  theoretical  analysis  of  the  resulting  equations  by 
applying  a  lubrication  approximation.  The  general  equations  of  force  balance  and  continuity 
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\j\ 


Figure  2:  Relationship  between  the  moduli  of  the  deviatoric  stress  and  rate-of-strain  tensors 
for  a  Newtonian  (a)  and  Bingham  (b)  fluid. 


for  an  incompressible  fluid  of  velocity  u  =  (u,v),  are  given  by 

V  •  f  =  Vp,  (4) 

V  •  u  =  0,  (5) 

where  f  is  the  deviatoric  stress  tensor,  and  p  is  the  pressure.  A  constitutive  relation  that 
relates  the  deviatoric  stress  to  the  deformation  of  the  fluid  is  necessary  to  close  the  system. 
The  simplest  model  of  a  yield-stress  fluid  is  the  Bingham  model,  in  which  one  applies  the 
constitutive  relation 

r  f  =  Ki-  +  f,'7/|7|  (jtj>7v).  {6, %b) 

l  7  =  0  (|r|  <  ry), 

where  7  =  Vu  +  (Vit)r  is  the  rate-of-strain  tensor,  I7I  =  -y/7  :  7  and  |r|  =  V f  :  f  are 
the  moduli  of  the  rate-of-strain  and  deviatoric  stress  tensors  respectively,  the  constant  ry 
is  the  yield  stress,  and  /1  is  the  coefficient  of  viscosity.  Relation  (6b)  ensures  that  the  fluid 
does  not  deform  if  the  magnitude  of  the  deviatoric  stress  at  a  point  does  not  exceed  the 
yield  stress.  If  the  magnitude  of  the  deviatoric  stress  does  exceed  the  yield  stress,  then 
relation  (6a)  implies  that  the  magnitude  of  the  deviatoric  stress  in  excess  of  the  yield  stress 
is  linearly  proportional  to  the  deformation  of  the  fluid  7.  We  sketch  this  relationship  in 
figure  2,  in  which  we  compare  it  to  the  Newtonian  subcase  ry  =  0. 

Part  of  the  integration  of  the  boundary-value  problem  defined  by  equations  (4)-(6) 
involves  identifying  the  locations  of  yield  surfaces  where  the  magnitude  of  the  deviatoric 
stress  tensor  attains  the  yield  stress.  These  surfaces  provide  boundaries  for  rigid  regions  of 
unyielded  fluid,  called  rigid  plugs,  in  which  the  magnitude  of  the  deviatoric  stress  tensor 
does  not  exceed  the  yield  stress.  Apart  from  along  the  yield  surfaces,  the  stress  inside 
these  rigid  plugs  does  not  form  part  of  the  solution  described  by  the  Bingham  model.  The 
presence  of  the  non-linear  yield-stress  term  in  equation  (6a),  and  the  need  to  identify  yield 
surfaces,  have  the  potential  to  make  problems  involving  Bingham  fluids  especially  difficult 
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to  investigate  theoretically.  However,  given  that  the  thickness  of  the  mucus  layer  is  much 
less  than  the  characteristic  length  scale  L  of  its  horizontal  variation,  we  can  simplify  the 
equations  of  motion  substantially  by  making  a  lubrication  approximation  H  <C  L.  Under 
this  assumption,  a  scaling  analysis  of  the  terms  in  equation  (4)  implies  that  the  vertical 
shear  stress  f  =  fxy  is  much  greater  than  the  other  components  of  the  deviatoric  stress 
tensor.  Furthermore,  the  scaling  between  horizontal  and  vertical  velocity  gradients  given 
by  the  continuity  equation  (5)  implies  that  the  flow  is  predominantly  horizontal.  Thus,  to 
leading  order  in  the  lubrication  approximation,  the  horizontal  and  vertical  components  of 
equation  (4)  simplify  to 


Ty=Px,  (7) 

Py  =  °)  (8) 

respectively,  while  the  continuity  equation  (5), 

'U'X  4“  Vy  —  0,  (9) 

remains  unchanged.  Furthermore,  the  constitutive  relation  (6)  can  be  simplified  to 


f  T  =  HUy  +  TySgn(v,y)  (|f I  >  Ty ), 

l  Uy  =  0  (jfj  <  Ty), 


(10a,  b) 


where  the  direction  of  deformation  is  given  simply  by  the  sign  of  the  vertical  shear  sgn('u^). 
The  reduced  equations  (7)-(8)  represent  a  balance  between  the  divergence  of  vertical  shear 
stress  and  the  horizontal  gradient  of  the  pressure,  which  is  independent  of  the  vertical 
coordinate  at  each  x-location. 

For  retrograde  snail  locomotion,  equations  (7)-(10)  are  subject  to  the  following  no-slip 
conditions  on  the  upper  and  lower  surfaces 

u  =  U(t)  (y  =  Y),  (11) 

u  =  0  (y  =  0),  (12) 


respectively.  Furthermore,  the  equation  for  the  pressure  at  the  foot’s  surface  (1),  combined 
with  continuity  of  normal  stress,  implies  that  the  fluid  pressure  is  given  by 

p  =  Af  +  DRY  ( y  =  Y ),  (13) 


to  leading  order  in  the  lubrication  approximation,  where  we  have  absorbed  the  constant 
DrH  into  the  pressure.  Equation  (8)  implies  that  the  pressure  given  by  (13)  applies 
throughout  the  vertical  extent  of  the  layer  at  each  x-location.  In  the  following  analysis, 
we  assume  a  sinusoidal  muscular  forcing  of  the  form 

/  =  sin[/c(x  +  ct)}  =  sin  (fc|).  (14) 

Noting  that  this  forcing  can  be  considered  steady  with  respect  to  the  variable  £,  we  proceed 
to  use  £  in  favour  of  the  x-coordinate.  Thus,  we  recast  the  system  in  terms  of  (£,t),  and 
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transform  partial  derivatives  according  to 


d_  d_ 

dx 

d  d  d 


(15) 

(16) 


We  proceed  to  assume  that  all  variables  in  the  system  are  2-7r-periodic  in  k£. 

On  the  assumption  that  the  only  surface  force  applied  on  the  main  body  of  the  swimmer 
is  from  the  mucus  layer,  we  can  derive  an  equation  for  the  acceleration  of  the  main  body  of 
the  snail  in  terms  of  the  tangential  force  applied  by  the  mucus  along  the  underside  of  the 
foot, 

-  dfl  k  f - 

MS=^L  +  (17) 

where  M  represents  the  mass  of  the  main  body  of  the  snail  per  unit  wavelength,  tq  = 
f(£,  0 ,  t),  fs  =  r(£,  Y,t),  and  the  angled  brackets  denote  the  average  ($)  =  ^  4>  d£. 

The  second  equality  in  (17)  is  derived  by  applying  integration  by  parts  to  the  pressure  term, 
and  using  equations  (7)-(8)  to  rewrite  the  resulting  integrand  in  terms  of  the  stress  at  the 
lower  surface. 

We  can  derive  an  evolution  equation  for  the  thickness  profile  by  depth-integrating  the 
continuity  equation  (24)  over  the  thickness  of  the  layer,  to  yield 


dY  dq 


or  in  terms  of  (£,  t), 

dY  dY  dq 

— 7T  H - 7T  H - ~  —  0, 

dt  d£  d£ 

where  the  flux  is  defined  by  q(£,  t)  =  fy  u  d y. 

We  non-dimensionalize  the  system  above  according  to 


«-*«■ 

y  -- 

=  Hy, 

Y  =  HY, 

t  =  1 T4’ 
kc 

u  =  cu, 

u 

=  cU , 

s> 

II 

yc 

P  -  kH 2 

(18) 


(19) 


(20) 


where  dimensionless  variables  are  denoted  without  the  hat  symbol.  In  terms  of  dimensionless 
variables,  equations  (7)-(8)  become 


Tv=Pt>  (21) 

p  =  Asin^  +  DY,  (22) 
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respectively,  and  the  constitutive  relation  (6)  becomes 


f  T  =  Uy  +  Bsgn(uy)  (|r|  >  B), 

I  %  =  0  (\t\  <  B). 

The  equations  of  average  thickness  and  depth-integrated  continuity  become 


(23a,  b) 


00  =  1,  Yt  +  Yt  +  qs  =  0, 


(24) 


the  boundary  conditions  ( 1 1)— (12)  become 

(25) 

(26) 


u  =  U(t )  (y  =  Y), 

u  =  0  (y  =  0), 


and  the  equation-of-motion  of  the  body  (17)  becomes 

&  =  i 7<To>’ 

where 

„  kH2A  kH3DR  Hty  HckM 

A  = - ,  D  =  - B= - -,  M  = - 

fj,c  yc  pc  /j, 


(27) 


(28) 


are  the  non-dimensional  amplitude,  non-dimensional  stiffness,  Bingham  number,  and  non- 
dimensional  mass  of  the  main  body  per  unit  wave  length,  respectively.  The  Bingham  number 
can  be  considered  a  non-dimensional  yield  stress. 

The  system  above  defines  an  initial- value  problem  for  the  evolution  of  the  free  boundary 
Y(£,t)  and  the  locomotion  velocity  U(t).  To  solve  it,  we  can  integrate  equations  (21)-(23) 
subject  to  the  boundary  conditions  (26)-(25)  for  the  velocity  profile,  given  Y (£,  t)  and 
U(t)  at  each  point  in  time.  With  the  velocity  profiles  in  hand,  we  evolve  the  thickness  in 
time  using  equation  (24),  and  the  locomotion  velocity  using  equation  (27),  in  which  the 
surface  stress  tq  is  established  as  part  of  the  integration  for  the  velocity  profiles.  The  main 
complication  in  the  above  procedure  is  the  construction  of  the  velocity  field  for  a  given 
Y(x,t )  and  U(t),  which  we  proceed  to  detail  in  the  following  section. 


2.3  Construction  of  the  flow  profiles 

To  begin,  we  can  integrate  equation  (21)  with  respect  to  y  to  yield 

T  =  T0+p?y,  (29) 

where  To  is  an  as  yet  undetermined  function  of  £.  This  expression  for  the  stress  applies 
in  any  yielded  parts  of  the  flow.  Combining  equation  (29)  with  the  component  of  the 
constitutive  relation  (23a)  associated  with  the  yielded  regions,  provides  an  equation  for  the 
velocity  profile  in  the  yielded  regions, 

uy  =  r0  +  p^y  -  Bsgn(uy).  (30) 
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Integrating  this  equation  provides 


u  =  \P&2  +  N  “  Bs&n (uy)\y  +  K, 


(31) 


where  K  is  a  constant  of  integration,  which  shows  that  the  velocity  profiles  in  the  yielded 
regions  of  the  flow  are  parabolae. 

As  part  of  the  solution  of  the  velocity  profiles,  it  is  also  necessary  to  determine  the 
locations  of  any  yield  surfaces  where  |r|  =  B,  which  provide  the  boundaries  of  any  unyielded 
regions  of  the  flow  governed  by  (23b).  We  can  identify  two  potential  yield  surfaces  y  =  r/± 
by  setting  t  =  ±B  in  equation  (29)  to  give 


V± 


to  B 
Pi  N’ 


(32) 


where  the  modulus  of  the  pressure  gradient  has  been  used  in  the  second  term  on  the  right- 
hand  side  to  ensure  that  ??+  >  rj-  independently  of  the  sign  of  the  pressure  gradient.  We 
refer  to  rj±  as  potential  yield  surfaces  because  it  is  possible  for  one,  or  both,  of  r/±  to  he 
outside  the  domain  of  the  fluid.  Equation  (32)  implies  that  there  exists  a  region  <  y  <  7/4- 
in  which  the  velocity  is  independent  of  the  vertical  coordinate.  Within  these  regions,  the 
flow  is  dominated  by  the  yield  stress  to  leading-order  in  lubrication  theory  and  thus  exhibits 
plug-like  characteristics  in  the  vertical  direction.  We  remark  that  these  regions  are  not  true 
rigid  plugs  unless  their  velocity  is  also  independent  of  the  ^-coordinate.  Thus,  we  refer  to 
regions  bounded  by  the  yield  surfaces  as  pseudo-plugs  if  they  are  not  strictly  rigid. 

By  adding  and  subtracting  the  expressions  for  the  potential  yield  surfaces  (32)  we  deduce 
that 

^  2r°  2B  m 

Pi  m 

respectively.  The  latter  of  these  equations  provides  the  width  of  the  pseudo-plug  if  it  lies 
fully  inside  the  domain,  or  a  maximum  for  its  width  if  it  intersects  one  of  the  side  walls.  The 
former  can  be  used  to  relate  the  positions  of  the  potential  yield  surfaces  to  the  unknown  tq. 

As  mentioned  above,  it  is  not  necessarily  the  case  that  the  potential  yield  surfaces 
given  by  equation  (32)  lie  inside  the  domain  of  the  fluid  0  <  y  <  Y .  For  example,  if 
77+  >  0  and  0  <  rj-  <  Y,  then  there  exists  a  pseudo-plug  in  the  region  r)^  <  y  <  Y 
that  intersects  the  upper  surface.  In  this  case  the  velocity  of  the  pseudo-plug  is  equal 
to  the  velocity  of  the  side  wall  it  is  in  contact  with  and  the  construction  of  the  velocity 
profile  is  different  to,  for  example,  the  regime  in  which  the  pseudo-plug  lies  fully  inside  the 
domain.  Thus,  the  different  arrangements  arising  from  whether  or  not  one  or  both  of  the 
potential  yield  surfaces  lie  inside  the  domain  require  consideration  on  a  case-by-case  basis. 
We  summarize  the  equations  for  the  five  possible  arrangements  in  appendix  A.  For  the 
purposes  of  demonstration,  we  proceed  to  construct  the  velocity  profile  for  regime  C,  in 
which  the  potential  locations  of  the  yield  surfaces  lie  inside  the  boundary  of  the  domain 
0  <  r}±  <  Y. 

We  note,  by  substituting  r  =  ±B  into  the  constitutive  relation  (23a),  that 


uy  =  0  (y  =  rj±), 


(34) 
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on  the  boundaries  of  the  pseudo-plugs,  i.e.  the  vertical  gradient  of  the  velocity  is  continuous 
at  the  potential  yield  surfaces.  Combining  this  condition  with  the  equation  for  the  parabolic 
profiles  in  the  yielded  regions  given  by  (31),  we  can  write  down  a  profile  of  the  form 


u  = 


u. 

n 


v  +  \pdy  ~  p+f 


Up  +  \pn{y  -  r]-Y 


(v+  <  y  <  Y), 
(v-  <  y  <  v+), 
(o  <  y  <  V-), 


(35) 


where  the  plug  velocity  up  =  —  ^p^rj^L  is  established  on  applying  condition  (26).  We  deter¬ 
mine  the  remaining  unknown  in  the  profile  by  applying  the  no-slip  condition  at  the  upper 
interface  (25)  to  give 

U  =  ^[(Y-rl+)2-712_}.  (36) 

Using  equation  (33b)  to  eliminate  r/_  in  this  equation,  and  rearranging  for  the  location  of 
the  upper  yield  surface,  provides 


p\Y2  -  2 Upt:  -  4 Bz 


y+  =  - 


2\pz\(Y\pt\-2B) 


The  lower  yield  surface  is  then  given  by 

V-  =  V+ 


2  B 

W’ 


(37) 


(38) 


using  equation  (33b).  Having  established  the  velocity  profile  (35),  we  can  now  derive  the 
flux 

Q=  \Pi\{Y  ~  y+f  +  rf- ~  ZP-Y],  (39) 

which  can  be  used  in  equation  (18)  to  find  the  rate-of-change  of  the  thickness  profile  at 
each  ^-location.  For  the  evolution  of  the  locomotion  velocity,  we  calculate  the  integral  on 
the  right-hand  side  of  equation  (27)  using  the  expression 


to  =  lpz(rj+  +  r]-), 


(40) 


which  is  given  by  rearranging  equation  (33a). 

As  shown  in  appendix  A,  different  regimes  have  different  expressions  for  the  potential 
yield  surfaces.  Thus,  in  constructing  the  flow  profile  we  determine  the  regime  at  a  particular 
^-location  by  calculating  the  potential  yield  surfaces  in  all  regimes  and  then  choose  the  one 
that  is  consistent  with  the  regime  definition.  Note  that  although  the  potential  yield  surfaces 
for  regimes  A  and  E  are  not  used  in  the  expressions  for  the  velocity  profiles  or  fluxes,  they 
are  still  relevant  in  the  equation  for  tq  given  by  (40),  which  we  use  to  evaluate  the  right-hand 
side  of  equation  (27). 

2.4  Initial-value  problem 

We  solve  the  initial-value  problem  described  by  equations  (21)-(27)  numerically  by  applying 
the  Matlab  time-integrator  ode  15s,  and  using  fast  Fourier  transforms  to  evaluate  spatial 
derivatives.  At  each  step  of  the  time  integration,  we  evaluate  the  pressure  held  using 
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Figure  3:  Locomotion  velocity  as  a  function  of  time  in  the  cases  (a)  A  =  1,  B  =  0.1,  D  =  1, 
M  =  1,  (b)  A  =  1,  B  =  0.14,  D  =  1,  M  =  1,  (c)  A  =  1,  B  =  0.1,  D  =  3,  M  =  1,  and  (d) 
A  =  1,  B  =  0.1,  D  =  0,  M  =  1. 

equation  (22),  construct  the  flow  profiles  using  the  formulae  in  appendix  A,  and  use  the 
resulting  flux  to  determine  the  rate-of-change  of  the  thickness  profile  using  equation  (18). 
Further,  we  use  the  formulae  for  rj±  given  in  appendix  A  to  evaluate  the  surface  stress  to 
using  equation  (40),  which  we  then  use  to  determine  the  rate-of-change  of  the  locomotion 
velocity  using  equation  (27).  Our  solutions  for  the  evolution  of  the  locomotion  velocity  in 
the  four  cases  (a)-(d)  defined  by 


(a) 

A  =  1,  5  =  0.1,  5  =  1,  M  =  1 

(b) 

A  =  1,  B  =  0.14,  D  =  1,  M  =  1 

(c) 

A  =  1,  B  =  0.1,  D  =  3,  M  =  1 

(d) 

A  =  1,  B  =  0.1,  D  =  0,  M  =  1 

in  which  we  apply  the  initial  conditions 

U  =  0  }  (i  =  0);  (41) 

are  shown  in  figure  3.  The  corresponding  evolutions  of  the  thickness  profiles  in  cases  (a) 
and  (b)  are  shown  in  the  space-time  plots  of  figure  4. 
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Figure  4:  Space-time  plots  of  the  evolution  of  the  thickness  profile  Y (£,  t)  for  cases  (a) 
A  =  1,  B  =  0.1,  D  =  1,  M  =  1,  and  (b)  A  =  1,  B  =  0.14,  D  =  1,  M  =  1. 


337 


For  case  (a),  we  observe  an  initial  transient  during  which  the  locomotor  remains  station¬ 
ary  for  a  short  period  after  its  muscles  are  activated.  Although  the  locomotor  is  stationary 
during  this  initial  phase,  its  thickness  profile  is  strongly  time-dependent,  as  indicated  by  the 
space-time  plot  in  figure  4(a).  At  t  ~  1.2,  the  locomotor  rapidly  accelerates  to  a  maximum 
velocity  in  the  first  half  of  the  forcing  period,  and  subsequently  decelerates  back  to  U  =  0. 
After  completing  a  secondary  phase  of  no  locomotion,  the  locomotor  subsequently  remains 
in  forward  motion  with  a  velocity  that  converges  toward  a  steady  value.  The  oscillation 
of  the  locomotion  velocity  is  due  to  a  periodic  change  in  the  direction  of  the  tangential 
stress  on  the  locomotor  due  to  the  muscular  force  wave  as  it  propagates  along  the  foot.  By 
comparing  solutions  (a)  and  (c),  we  observe  that  a  larger  stiffness  has  the  effect  of  reducing 
the  amplitude  of  the  oscillation.  In  case  (d),  in  which  the  foot  is  perfectly  inelastic,  the 
locomotor  does  converge  quickly  toward  a  steady  state,  and  appears  to  remain  in  a  neutrally 
stable  oscillation  for  all  time. 

In  the  limit  of  large  stiffness  and  large  amplitude,  the  dominant  balance  in  the  pressure 
equation  (22)  is  between  the  muscular  forcing  and  the  stress  due  to  elastic  deformation, 
which  recovers  the  case  of  imposing  the  displacement  of  the  foot  surface.  In  this  case  it  is 
clear  that  increasing  the  stiffness  has  the  same  effect  as  reducing  the  amplitude.  Thus,  one 
could  hypothesize  that  increasing  the  stiffness  always  reduces  the  steady-state  locomotion 
velocity.  However,  we  observe  that  the  steady-state  velocity  of  case  (c)  is  greater  than  that 
of  case  (a).  This  suggests  that  it  is  possible  to  improve  locomotion  by  using  a  stiffer  material 
where  the  resultant  force  due  to  the  muscular  forcing  and  elastic  stress  is  more  efficient  at 
creating  the  tangential  stresses  on  the  lower  surface  of  the  foot  that  leads  to  locomotion. 

The  observed  regions  of  no  locomotion  can  be  explained  on  the  basis  that  after  its  inertia 
vanishes,  the  locomotor  must  wait  until  the  muscular  and  elastic  forces  can  break  the  yield 
stress  at  each  £- locomotion  in  order  to  restart.  In  case  (b),  which  has  a  greater  Bingham 
number  than  case  (a),  we  observe  that  the  locomotor  goes  through  a  similar  initial  transient 
to  that  of  case  (a)  but  fails  to  restart  on  the  second  oscillation,  subsequently  converging 
to  a  steady  state  with  no  locomotion.  Thus,  we  can  identify  a  critical  Bingham  number 
B*(A,D )  that  lies  between  the  Bingham  numbers  of  cases  (a)  and  (b),  which  defines  the 
value  above  which  no  locomotion  occurs  in  the  steady  state.  We  observe  from  the  space- 
time  plot  of  figure  4(b)  that  after  the  locomotor  becomes  stationary,  its  thickness  profile 
continues  to  evolve  in  time  and  converges  toward  a  steady-state  in  a  similar  manner  to  that 
of  case  (a).  Curiously,  the  locomotor  in  case  (b)  has  been  able  to  translate  forwards  despite 
it  coming  to  a  standstill  in  the  steady  state,  which  suggests  that  it  could  still  be  able  to 
locomote  by  reducing  its  amplitude  over  time  and  then  suddenly  restarting  it  at  periodic 
intervals.  We  proceed  to  focus  our  attention  on  the  eventual  steady  states  of  the  system,  in 
which  we  identify  the  value  of  the  critical  value  B*. 

2.5  Steady  state 

The  steady  states  of  the  system  can  be  established  directly  by  considering  the  equations 
in  the  case  where  the  time-derivatives  are  set  to  zero.  In  this  case,  the  thickness  evolution 
equation  (19)  becomes 

■i~(Y  +  q)=0,  (42) 
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Figure  5:  Steady-state  profiles  in  cases  (a)  A  =  1,  B  =  0.1,  D  =  1,  and  (b)  A  =  1,  B  =  0.14, 
D  =  1.  Regions  of  unyielded  fluid  are  shown  in  blue. 
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Figure  6:  Steady-state  locomotion  velocity  (red),  flux  (blue),  and  flux  with  a  fixed  locomotor 
(green),  which  is  relevant  to  the  process  of  peristalsis,  for  different  values  of  the  Bingham 
number,  in  the  case  A  =  1,  D  =  1. 


which  integrates  to  give 


Y  +  q  =  l-Q, 


(43) 


where  Q  is  the  steady-state  flux  in  the  negative  ^-direction.  This  equation  defines  an  eigen¬ 
value  problem  for  the  unknown  steady-state  thickness  profile  Y,  and  the  two  eigenvalues  Q 
and  U,  for  which  we  apply  the  two  constraints 


(Y)  =  (44) 

(to)  =  0,  (45) 

given  by  the  mean  thickness  condition  (24)  and  the  steady  form  of  (27),  respectively.  We 
solve  this  eigenvalue  problem  numerically  using  a  Newton- Raphson  iteration  scheme.  Noting 
that  trial  profiles  converge  rapidly  to  the  Newtonian  case  B  =  0,  we  initialize  the  scheme 
with  the  Newtonian  case  and  progressively  increment  B  in  sufficiently  small  steps  to  allow 
convergence  at  each  step. 

Our  solutions  for  the  steady-states  in  cases  (a)  and  (b)  are  shown  in  figure  5,  and  the 
relationships  between  the  locomotion  velocity,  flux  and  Bingham  Number  in  the  case  A  =  1, 
D  =  1,  are  shown  in  figure  6.  In  case  (b),  which  lies  in  B  >  B*,  there  is  no  locomotion  in  the 
steady  state.  We  see  from  5(b)  that  the  thickness  profile  becomes  raised  and  depressed  near 
the  points  £  =  0  and  £  =  vr,  respectively,  which  correspond  to  regions  where  the  pressure 
gradient  attains  its  maximum  and  minimum  values.  Furthermore,  the  yield  surfaces  touch 
both  the  upper  and  lower  boundaries  close  £  =  ir/2  and  £  =  3ir/2,  which  are  regions  where 
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the  pressure  gradient  becomes  close  to  zero  and  subsequently  changes  sign.  The  plugs  in 
these  regions  span  the  vertical  extent  of  the  layer,  which  explains  why  the  locomotor  must 
remain  stationary.  Although  there  is  no  locomotion,  there  is  still  fluid  motion  in  the  yielded 
parts  of  the  C-regions,  which  leads  to  a  net  pumping  of  fluid  in  the  negative  ^-direction. 
Note  that  although  q  =  0  in  the  rigid  plugs,  it  is  the  steady  flux  in  the  wave  frame  q  +  Y 
that  is  uniform  in  the  above  solutions,  so  it  clear  that  a  net  transport  of  fluid  is  possible 
with  fluid  crossing  the  yield  surfaces.  In  cases  such  as  this  in  which  the  locomotor  does 
not  move,  we  can  think  of  the  system  as  being  equivalent  to  that  of  a  peristaltic  pump,  in 
which  a  condition  of  no  horizontal  translation  on  the  upper  surface  would  be  imposed  at  all 
times. 

We  observe  from  figure  6  that  the  flux  eventually  vanishes  at  another  critical  value  of 
the  Bingham  number.  This  can  be  verified  by  noting  that  fluid  motion  stops  when  the  yield 
surfaces  of  the  C-regions  touch  the  boundaries  of  the  domain  everywhere,  which  first  occurs 
when  =  1.  Thus,  by  setting  U  =  0  in  equation  (88)  of  appendix  A,  we  deduce  that 

Jr  +  lr1’  <46> 

at  £  =  7T,  on  simplification.  Further,  noting  that  Y  is  uniformly  equal  to  one  in  the  fully 
unyielded  steady-state  system,  equation  (22)  implies  that  the  pressure  gradient  at  this  point 
is  given  by  p^( n)  =  —A.  Substituting  this  value  into  equation  (46)  provides  the  critical  value 

B  =  \A.  (47) 

The  steady  state  profile  for  case  (a),  displayed  in  figure  5(a),  shows  that  the  pseudo¬ 
plugs  near  £  =  7r/2  and  £  =  3tt/2  do  not  fully  span  the  vertical  extent  of  the  layer.  Thus, 
the  locomotor  is  able  to  break  the  yield  stress  at  each  ^-location,  leading  to  a  positive 
locomotion  velocity.  Figure  6  shows  that  the  locomotion  velocity  increases  as  the  Bingham 
number  is  reduced  from  B*,  with  slow  growth  in  the  vicinity  of  B*,  and  approximately  linear 
growth  near  the  Newtonian  case  B  =  0.  In  figure  6,  we  also  compare  the  corresponding  flux 
in  the  steady  state  to  that  of  the  peristaltic  case,  which  shows  that  as  the  Bingham  number 
is  decreased  from  B*,  the  flux  soon  starts  to  decrease,  but  the  flux  in  the  peristaltic  case 
continues  to  increase. 


3  Prograde  locomotion 
3.1  The  foot 

The  defining  difference  between  retrograde  and  prograde  locomotion  is  given  by  whether 
the  resultant  forcing  applied  by  the  foot  muscle  is  predominantly  tangential  or  normal  to 
the  surface  of  the  foot.  For  prograde  locomotion,  we  assume  that  the  contraction  pattern 
of  the  inclined  muscular  fibres  in  the  snail’s  foot  is  such  as  to  provide  a  predominantly 
tangential  resultant  force  at  the  base  of  the  foot.  In  this  case  equations  (1) — (3),  which  were 
relevant  in  the  retrograde  case,  no  longer  apply.  Assuming  that  the  main  body  of  the  snail 
acts  as  a  lubricated  rigid  foundation  that  does  not  allow  vertical  penetration  but  does  allow 
tangential  slip  with  a  no-stress  condition,  it  can  be  shown  that  the  elastic  deformation  of 
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Figure  7:  Schematic  of  the  foot  and  mucus  layer  in  prograde  locomotion. 


the  foot  is  predominantly  horizontal  to  leading-order  in  d  <C  L.  Thus,  we  assume  that  the 
base  of  the  foot  can  undergo  significant  tangential  contraction  so  that,  unlike  the  retrograde 
case,  we  do  not  impose  the  condition  that  the  base  of  the  foot  moves  with  the  locomotion 
velocity  (3).  Instead,  we  impose  a  tangential-stress  condition, 

r  =  Af(x,  t)  +  Dpiisx,  (48) 

where  Af  is  the  tangential  muscular  forcing,  A  is  the  amplitude  of  the  forcing,  and  the 
stiffness  coefficient  Dp  can  be  related  to  the  elastic  properties  of  the  foot.  The  latter  term 
on  the  right-hand  side  of  this  equation  represents  the  elastic  force  derived  from  the  relative 
compression  of  neighbouring  points  on  the  foot’s  surface.  Thus,  we  can  think  of  the  foot  as 
a  large  collection  of  vertical  sections  joined  by  Hookean  springs,  as  illustrated  in  figure  7. 
Motivated  by  observations,  we  assume  a  travelling- wave  forcing  of  the  form 

/  =  f(K),  (49) 

where  £  =  x  —  ct.  For  positive  wave  speed  c,  this  corresponds  to  a  wave  that  propagates 
in  the  positive  x-direction,  chosen  in  anticipation  that  this  will  lead  to  locomotion  in  the 
positive  x-direction. 

The  absence  of  significant  vertical  deformation  implies  that  we  can  assume  that  the 
surface  of  the  foot  remains  in  its  vertical  equilibrium  position 

Y  =  H,  (50) 

at  all  times.  Assuming  that  the  muscular  fibres  of  the  foot  are  anchored  inside  the  main 
body  of  the  snail,  we  can  define  the  locomotion  velocity  of  the  main  body  of  the  snail  as 
the  average  of  the  surface  velocity, 

U=(ua).  (51) 

The  equations  above  describe  the  mechanics  of  a  snail’s  foot  undergoing  prograde  locomo¬ 
tion.  We  proceed  to  consider  the  dynamics  of  the  mucus  layer  due  to  the  new  conditions 
(48)-(51),  and  highlight  the  differences  to  the  retrograde  case. 
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3.2  The  mucus  layer 

In  the  prograde  case,  the  mechanics  of  the  mucus  layer  is  complicated  by  the  fact  that  there 
is  no  longer  a  simple  expression  for  the  pressure,  but  is  simplified  by  the  fact  that  it  is 
no  longer  necessary  to  evolve  the  thickness  profile  in  time.  The  equations  of  force  balance 
and  continuity  (7)-(10)  are  equally  relevant  in  the  prograde  case,  but  we  no  longer  assume 
the  surface  velocity  condition  (11)  or  the  pressure  condition  (13).  Instead,  equation  (48) 
combined  with  continuity  of  tangential  stress  at  the  foot’s  surface,  provides  a  shear-stress 
condition 

t  =  ts  =  Af  +  Dpus£  ( y  =  H ).  (52) 

We  can  non-dimensionalize  the  relevant  equations  (7),  (8),  (10),  (12),  (19),  (50)  and 
(52)  according  to 


(53) 


€  =  y  =  Hv, 

HA  j 

u  =  - u,  t  =  At, 

y 

where  dimensionless  variables  are  denoted  without  the  hat 
dimensional  equations  of  force  balance  are 

Tv  =  P 
Py  =  0, 


Y  =  HY , 


A 

p  =  mp ’ 


f  T  =  Uy  +  Rsgn(uy)  (|t|  >  R), 

\  Uy  =  0  (|t|  <  R), 


with  constitutive  relation, 


and  boundary  conditions, 


r  =  Ts  =  f  +  Dusi  (y  =  1), 
u  =  0  (y  =  0). 


symbol.  The  resulting  non- 

(54) 

(55) 

(56a,  b) 

(57) 

(58) 

(59) 

(60) 


The  thickness  and  depth-integrated  continuity  equations  are  given  by 

Y  =  1,  q£  =  0, 

respectively,  where  in  the  equations  above, 

r=X,  o  =  ™r, 

A  y 


are  the  yield-stress-amplitude  ratio,  and  the  non-dimensional  stiffness,  respectively.  The 
non-dimensionalization  above  shows  that  the  absence  of  a  scaling  between  the  rate-of- change 
of  the  thickness  profile  and  the  divergence  of  flux  in  the  depth-integrated  continuity  equation 
(59)  implies  that  unlike  the  retrograde  case,  it  has  been  possible  to  incorporate  the  yield 
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stress  and  amplitude  into  a  single  parameter  R.  We  remark  that  the  limit  of  large  stiffness 
recovers  the  case  of  imposing  the  tangential  displacement  of  the  foot’s  surface. 

3.3  Eigenvalue  problem 

The  depth-integrated  continuity  equation  (59)  can  be  integrated  to  give 


q  =  Q,  (61) 

where  Q  is  the  flux  of  fluid  in  the  positive  ^-direction.  Thus,  the  prograde  system  has  been 
reduced  to  an  eigenvalue  problem  for  the  unknown  pressure  gradient  p £  and  eigenvalue  Q , 
that  we  solve  subject  to  the  periodicity  constraint 


(P«)  =  0.  (62) 

As  in  the  retrograde  case,  the  main  complication  is  in  the  construction  of  the  velocity 
profiles  when  a  yield  stress  is  present.  However,  before  discussing  the  effects  of  yield  stress, 
we  begin  by  considering  the  simpler  Newtonian  case. 

Newtonian  solution 

In  the  Newtonian  case  R  =  0,  we  can  relate  the  flux  Q  and  locomotion  velocity  U  to  the 
forcing  function  analytically.  To  show  this,  we  begin  by  integrating  equation  (70),  with  the 
constitutive  relation  (56)  with  R  =  0,  to  give  the  parabolic  velocity  profile 

u  =  \p^V2  +  (ts  -  p^)y,  (63) 

in  which  we  have  applied  the  condition  of  no-slip  at  the  base  (58).  The  associated  formula 
for  the  flux  is  then  given  by 

Q  =  \ts  -  \vi  =  Q ,  (64) 

which  is  uniformly  equal  to  Q  according  to  (61).  Taking  the  ^-average  of  this  equation 
yields 

Q  =  (65) 

on  applying  the  continuity  constraint  (62).  Substituting  the  expression  for  the  surface  stress 
(57)  gives 

Q  =  lif  +  Dust)  =  1(f),  (66) 

noting  that  the  average  of  the  elastic  term  vanishes.  Thus,  we  have  established  the  flux  in 

terms  of  the  imposed  tangential  forcing.  To  determine  the  associated  locomotion  velocity, 
we  evaluate  the  velocity  of  the  fluid  at  the  surface  by  setting  y  =  1  in  equation  (63)  to  give 

Us  —  ts  }2p^.  (67) 

Now,  taking  the  ^-average  of  this  equation  yields 

U  =  ( us )  =  (ts)  =  (f  +  Dust)  =  (/)>  (68) 
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on  noting  again  that  the  average  of  the  elastic  term  is  zero. 

Expressions  (66)  and  (68)  show  that  both  the  flux  and  the  locomotion  velocity  are 
linearly  related  to  the  mean  value  of  the  imposed  forcing  function  /.  Thus,  we  are  motivated 
in  our  analysis  of  the  non-Newtonian  case  to  prescribe  a  forcing  with  a  non-zero  mean.  In 
this  regard,  we  proceed  to  impose 

/  =  sin2  £,  (69) 

which  has  mean  value  (/)  =  We  remark  that  in  adopting  this  forcing,  it  is  not  possible 
to  balance  the  tangential  stress  exerted  on  the  main  body  of  the  snail  without  the  snail 
placing  the  front  or  back  of  its  foot  in  contact  with  the  ground  to  provide  a  frictional  force 
to  balance  that  of  the  muscular  contractions.  The  resulting  process  of  locomotion  is  similar 
to  that  of  an  inchworm,  which  is  known  to  be  adopted  occasionally  by  some  land  snails. 

As  a  side  note,  we  remark  that  it  is  possible  to  locomote  in  the  case  (/)  =  0  if  a  yield 
stress  is  present.  To  demonstrate  this,  consider  a  forcing  pattern  with  small  regions  of  high 
magnitude  forcing  in  the  positive  ^-direction,  and  with  larger  regions  of  low  magnitude 
forcing  in  the  negative  ^-direction,  constructed  with  (/)  =  0,  such  that  only  regions  of  the 
forcing  directed  in  the  direction  of  locomotion  will  break  the  yield  stress.  Then  it  is  clear 
that  the  velocity  of  the  foot  surface  will  be  either  stationary  or  directed  only  in  the  positive 
^-direction.  The  mean  surface  velocity  will  thus  be  positive,  providing  locomotion.  Such 
a  pattern  may  correspond  to  that  adopted  commonly  by  land  snails,  in  cases  where  the 
surface  of  the  foot  does  not  make  direct  contact  with  the  ground. 

The  effect  of  yield  stress 

The  effect  of  yield  stress  can  be  established  by  considering  the  eigenvalue  problem  defined 
by  equations  (61)-(62),  with  the  forcing  (69),  in  the  general  case  R  >  0.  To  simplify  the 
analysis,  we  assume  D  =  0  with  the  expectation  that  the  resulting  system  will  encapsu¬ 
late  the  behaviour  of  the  general  case.  We  solve  the  eigenvalue  problem  for  the  unknown 
pressure  gradient  numerically  using  a  Newton-Raphson  iteration  scheme  initialized  using 
the  Newtonian  solution  discussed  above.  In  each  iteration,  we  construct  the  flow  profiles 
with  the  pressure  gradient  form  the  previous  step  using  an  analogous  procedure  to  that 
outlined  in  section  2.3.  The  first  step  of  the  integration  of  the  flow  profiles  can  be  achieved 
by  integrating  (54)  subject  to  (57)  to  give 

t  =  f  +  pd1  ~  y)> 

from  which  we  deduce  that  the  potential  yield  surfaces  are  given  by 

/  R 

V±  =  l - ±  | — 7? 

Pi  m 

in  all  regimes.  The  other  formulae  associated  with  the  construction  of  the  velocity  profiles 
are  summarized  in  appendix  B.  For  our  solution,  we  plot  the  dependence  of  the  locomotion 
velocity  and  flux  on  the  yield-stress-amplitude  ratio  R  in  figure  8,  and  some  examples  of 
yield-surface  plots  and  pressure-gradient  profiles  for  specific  values  of  R  are  shown  in  figures 
9  and  10,  respectively. 

In  all  cases  of  R,  we  can  define  a  region  of  the  fluid  |£|  <  (d,  where  (  =  £  —  7t/2,  and 


(71) 


(70) 
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Figure  8:  Locomotion  velocity  and  flux  in  prograde  locomotion  (a),  and  a  blow-up  (b)  of 
the  region  R*  <  R  <  1,  in  which  we  plot  the  asymptotic  approximation  (80)  for  U. 
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(d  is  defined  by 


R  =  cos2(d,  (72) 

in  which  the  fluid  in  contact  with  the  foot  must  be  yielded  because  the  imposed  tangential 
forcing  at  these  points  exceeds  the  yield  stress.  Conversely,  the  region  of  the  fluid  |£|  > 
in  contact  with  the  foot  y  =  1  must  be  unyielded.  In  order  to  provide  a  non-zero  flux, 
it  is  necessary  that  the  structure  of  the  yield  surfaces  is  such  that  there  is  a  pseudo-plug 
extending  from  the  top  surface  and  a  flow  near  the  lower  surface,  corresponding  to  regime 
D  in  appendix  B;  this  is  evident  in  the  example  profiles  of  figure  9.  Furthermore,  we  can 
deduce  from  equation  (70)  that  the  pressure  gradient  is  always  negative  in  the  D-region, 

I C|  >  Cd,  and  positive  elsewhere,  which  is  verified  by  the  pressure-gradient  plots  of  figure 
10. 

The  case  R  =  0.2,  shown  in  figure  9(a),  is  fully  yielded  except  in  the  pseudo- plugs  of 
the  D-regions  discussed  above.  As  R  is  increased,  there  is  a  critical  point  R  ~  0.253,  where 
a  pseudo-plug  emerges  from  the  base  at  £  =  0.  This  B-region,  shown  in  figure  9(b)  for  the 
case  R  =  0.6,  proceeds  to  grow  as  R  is  further  increased.  We  observe  from  figure  8  that 
the  flux  remains  approximately  half  the  locomotion  velocity  in  the  region  0  <  R  <  0.4  and 
beyond  this  point,  the  ratio  of  the  flux  to  the  locomotion  velocity  begins  to  decrease  at 
a  much  faster  rate.  At  R  ~  0.762,  a  C-region  appears  inside  the  B-region  with  a  yielded 
region  emerging  at  the  base,  and  a  groove  in  the  upper  yield  surface,  as  shown  in  figure 
9(c)  for  the  case  R  =  0.8.  At  this  stage,  the  regime  progression  across  the  period  can  be 
summarized 

D— >E— >A— >B— >C— >B— >A— >E— >D,  (73) 

where  the  letters  correspond  to  the  regime  definitions  given  in  appendix  B.  The  fully- 
yielded  A-  and  E-regions,  which  are  too  narrow  to  be  discerned  clearly  in  figure  9(c),  lie 
near  locations  where  the  pressure  gradient  changes  sign,  as  can  be  verified  by  plot  (c)  in 
figure  10.  As  R  is  increased  further,  there  is  a  critical  point  R*  where  the  pseudo-plugs  in 
the  D-regions  touch  the  base  and  become  rigid  plugs  that  span  the  thickness  of  the  layer. 
At  this  point,  the  flux  vanishes  and  the  velocity  of  the  foot’s  surface  us  becomes  zero  within 
the  region  |£|  >  C,d-  However,  the  remaining  motion  of  the  foot  in  the  C-region  |£|  <  C,d 
is  still  able  to  drive  a  small  locomotion  velocity.  The  behaviour  of  the  flux  and  locomotion 
velocity  for  R*  <  R  <  1  is  shown  in  figure  8(b). 

The  onset  of  locomotion 

It  is  clear  that  there  is  no  fluid  motion  if  R  >  1  because  in  this  case  the  shear  stress  does 
not  exceed  the  yield  stress  anywhere  in  the  fluid.  As  mentioned  above,  a  C-region  emerges 
about  (  =  0  as  R  is  decreased  from  1,  such  that  although  the  fluid  flux  remains  zero,  the  foot 
is  still  able  to  drive  locomotion.  This  can  be  contrasted  with  the  retrograde  case  in  which 
the  flux  is  always  non-zero  when  locomotion  occurs.  We  proceed  to  solve  the  eigenvalue 
problem  for  the  pressure  gradient  in  the  case  R*  <  R  <  1  by  applying  the  constraint  Q  =  0 
in  place  of  the  periodicity  constraint  (62).  The  latter  is  abandoned  because  a  solution  for 
the  pressure  gradient  inside  the  rigid  plugs  is  not  determined  as  part  of  the  Bingham  model. 
We  can  assume  that  the  periodicity  constraint  is  accounted  for  via  an  elastic  force  balance 
within  the  rigid  regions  that  we  need  not  consider  explicitly. 
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Figure  9:  Yield-surface  profiles  in  the  cases  (a)  R  =  0.2,  (b)  R  =  0.6,  (c)  R  =  0.8,  and  (d) 
R  =  0.9.  Regions  of  unyielded  fluid  are  shown  in  blue. 
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Figure  10:  Pressure-gradient  profiles  in  the  cases  (a)  R  =  0.2,  (b)  R  =  0.6,  (c)  R  =  0.8, 
and  (d)  R  =  0.9.  For  case  (d),  the  pressure  gradient  remains  undetermined  in  the  region 
ICI  >  |Cz)|  under  the  Bingham  model. 


In  the  C-region,  the  condition  of  zero  flux  implies  that 

(1  ~V+)3  +  V-(V-  ~  3)  =  0,  (74) 


using  equation  (90)  from  appendix  B.  Eliminating  r/±  in  favour  of  the  unknown  pressure 
gradient  using  equations  (71),  and  recalling  that  pg  >  0  for  |£|  <  (jj,  gives 

(. f-R )3  -  (2  PCi  +  f  +  R)  0 vet  -f-R?  =  0,  (75) 


which  is  an  algebraic  equation  for  the  pressure  gradient  pee  in  the  C-region.  We  solve  this 
cubic  equation  numerically  using  the  Matlab  routine  fzero.  From  our  solution,  we  can 
identify  two  positive  roots  if  R  <  1,  which  in  the  limit  of  R  — »  1,  are  given  by 


Pee,  ~  /  +  R  ± 


(f-R)3 

3  (f  +  R) 


1/2 


(76) 


to  leading  order.  It  can  be  verified  numerically  that  the  lesser  root  always  leads  to  an 
inconsistent  solution  with  rj-  <  0.  Thus,  we  use  the  greater  root  to  calculate  the  yield 
surface  profiles.  Our  solution  for  the  yield  surface  profile  in  the  case  R  =  0.9  is  shown 
in  figure  9(d),  and  the  corresponding  pressure  gradient  is  shown  by  plot  (d)  in  figure  10. 
Furthermore,  we  use  the  non-dimensional  form  of  equation  (51)  to  evaluate  the  locomotion 
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velocity 


(77) 


2  (<D 

U  =  -  usd(, 

k  Jo 

in  which  the  surface  velocity  is  given  by 


us  =  |ps[(l  ~V+)2  -V-] 


(pg-2i?)(2/-pg) 
2  Vi 


(78) 


which  we  have  established  using  equation  (89)  given  in  appendix  B.  We  use  the  resulting 
solution  for  U  to  extend  the  plot  in  figure  8(b)  to  the  region  R*  <  R  <  1. 

In  the  limit  R  — ►  1,  an  approximation  for  the  surface  velocity  can  be  derived  by  substi¬ 
tuting  the  leading-order  form  of  the  pressure  gradient  given  by  equation  (76)  into  equation 
(78)  to  give 

us~\(l-R-C,2)2.  (79) 

Using  this  expression  to  evaluate  the  integral  in  equation  (77)  provides 

U~  -^-(1-R)5/2,  (80) 


on  noting  that  Cd  ~  \/l  —  R  from  equation  (72).  Figure  8(b)  shows  that  this  approximation 
gives  good  agreement  with  the  exact  solution. 


The  onset  of  flux 

The  critical  value  R*  defines  the  point  at  which  the  flux  first  vanishes.  Moreover,  this  point 
corresponds  to  the  point  when  both  the  periodicity  constraint  (62)  and  the  flux  constraint 
Q  =  0  are  satisfied  simultaneously.  In  reference  to  the  profiles  shown  in  figure  8,  we  note 
that  as  R  approaches  R*  from  below,  this  critical  point  is  reached  when  the  lower  yield 
surface  p-  in  the  D-region  first  touches  the  lower  surface  such  that  p-  =  0.  Thus,  the 
pressure  gradient  in  the  D-region  at  the  critical  point  can  be  found  by  setting  r/_  =  0  in 
equation  (71)  to  give 

PD£  =  f  ~  R* i  (81) 

recalling  that  p^  <  0  for  |£|  >  The  pressure  gradient  in  the  C-region  is  given  by  the 
numerical  solution  to  (75)  described  above,  which  along  with  (81),  enters  the  continuity 
constraint  (62)  according  to 


PDZ  d(  =  0, 


(82) 


where  we  have  taken  the  integral  over  half  the  period  using  the  fact  that  the  solution  is 
symmetric  about  ((  =  0.  On  substituting  the  pressure  gradient  in  the  D-region  given  by 
equation  (81),  and  evaluating  the  second  integral,  this  equation  reduces  to 


Kd 

/  pet  dC  =  (R* 
Jo 


!)(§  -  Cd)  +  ;jsin(2CD). 


(83) 
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We  solve  the  integro-algebraic  system  defined  by  equation  (75)  with  R  =  R*  and  (83) 
numerically  by  treating  R*  as  a  shooting  parameter.  For  each  trial  in  this  method,  we  solve 
the  algebraic  equation  (75)  for  pcs  using  the  Matlab  routine  f  zero.  Our  solution  gives  the 
critical  value  R*  ~  0.888. 

A  near-exact  derivation  of  this  result  can  be  derived  semi-analytically  with  less  work  by 
making  use  of  the  fact  that  R*  ~  1,  and  recalling  that  in  the  limit  R  — >  1,  equation  (76) 
implies  that  the  pressure  gradient  in  the  C-region  is  given  approximately  by 

PCS  ~f  +  R*.  (84) 

Substituting  this  expression  into  equation  (83),  and  simplifying,  provides 

r/2/dC  +  JR*(2CB-f)«0,  (85) 

Jo 

and  hence 

f  +  iT  (2Cd-|)~0.  (86) 

Using  equation  (72)  to  eliminate  R*  in  this  equation  leaves  the  algebraic  equation 

f  +  (2Cd-§)cos2Cd  =  0,  (87) 

which  can  be  solved  numerically  for  the  solution  C^o  ~  0.343,  with  corresponding  critical 
value  R*  ~  0.887  calculated  using  equation  (72).  Given  that  this  value  is  very  close  to  the 
exact  value  derived  previously,  we  can  conclude  that  the  above  approximation  is  successful 
at  describing  the  region  of  the  parameter  space  in  which  R  is  close  to  R*. 

4  Conclusion 

We  have  investigated  theoretically  and  numerically  models  for  retrograde  and  prograde  snail 
locomotion  using  a  dynamic  formulation  that  accounts  for  both  the  elastic  and  muscular 
mechanics  of  the  snail’s  foot,  and  the  yield  stress  of  the  mucus  layer.  Our  solution  of  the 
initial- value  problem  in  the  retrograde  case  shows  an  initial  transient  that  involves  a  highly 
oscillatory  locomotion  velocity,  with  the  potential  for  extended  periods  of  no  locomotion. 
We  observe  a  transition  toward  a  steady  state  in  the  wave  frame  of  the  forcing,  which  we 
investigate  by  finding  solutions  of  the  associated  steady-state  eigenvalue  problem.  For  given 
values  of  the  non-dimensional  forcing  amplitude  and  stiffness,  we  identify  a  critical  value 
of  the  Bingham  number  above  which  there  is  no  locomotion  in  the  steady  state.  In  our 
analysis  of  prograde  locomotion,  we  find  that  the  locomotor  is  able  to  pump  fluid  in  the 
direction  it  is  travelling  only  below  a  critical  value  of  the  yield-stress-amplitude  ratio,  which 
we  have  identified  in  the  case  of  zero  stiffness.  Beyond  this  point,  the  flux  vanishes  but  the 
snail  is  still  able  to  locomote. 

A  notable  difference  between  the  two  forms  of  locomotion  we  have  studied  is  that  the  flux 
in  the  retrograde  case  is  directed  in  the  opposite  direction  to  that  of  locomotion,  while  in 
the  prograde  case,  the  snail  is  able  to  pump  fluid  in  the  direction  it  is  travelling.  Given  that 
mucus  production  is  a  substantial  chemical  cost  to  the  snail,  this  difference  may  account 
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for  why  land  snails  have  favoured  the  prograde  form  of  locomotion.  Furthermore,  we  can 
hypothesize  that  a  water  snail  predominantly  uses  ambient  water  as  a  lubricating  fluid,  and 
only  occasionally  produce  mucus  if,  for  example,  it  needs  to  climb  a  steep  incline. 

A  Regimes  in  retrograde  locomotion 

Regime  A:  (p^U  >  0,  rj±  <  0) 


U  =  \p^y{y  ~  u)  +  — ,  q  =  \UY  -  J^P^Y3 , 


2  B 

V-  =  V+  ~  rr 


W 


Regime  B:  (p^U  >0,  rj-  <  0  <  rj+  <  Y ) 


Ip^y-p+y  ( rj+<y<Y ) 

0  (0  <  y  <  r/+), 


where  the  pseudo-plug  is  stationary. 


q=1m(Y-V+)3- 


Regime  C:  (0  <  rj±  <  Y) 


(v+  <  v  <  Y), 
(v-  <  y  <  v+) 

(o  <  y  <  r/-), 


where  the  plug  velocity  is  given  by  up  = 

p2Y2  -  2 Upi  -  4 B2 


V+  2\p^Y\Pi\-2B)  ’  >1+~ 

Q  =  §P*[(X  -  ??+)3  +  V--  3 V-Y}- 


2  B 

V-  =  V+~  ] - T 


(88) 


Ip*  I 


Regime  D:  (p^U  <  0,  0  <  r/_  <  Y  <  77+) 


u  = 


v-p  +  hpdv  ~  v-)2 


(v-  <y  <  Y), 

(0  <  y  <  rj-), 
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where  the  plug  velocity  is  given  by  up  =  —  \pgryL. 


/  2  U  2  B 

V-  =  \ - 1  V+  =  V-  +  ] — | ) 

V  Pi  m 


q  =  ~  31"). 


Regime  E:  (p^U  <0,V±>  1) 


Uy 


=  \PH Av~u)  +  Q  =  \UY  —  j^p^Y3, 


lv  U  2  B 

V-  =  2Y - v  ’  r?+  =  r?-  +  r_ I' 

PiY  |pc| 


B  Regimes  in  prograde  locomotion 

The  potential  yield  surfaces  at  y  =  r/±  are  given  by 


i  Ts  -l.  R 

V±  =  1 - ±  | — r, 

Pi  m 


in  all  regimes. 

Regime  A:  ( p £  >  0,  rj±  <  0) 


u  =  \p^y2  +  (rs  -pt--  R)y, 


q  =  ~Ipz  +  \{ts  -  R). 
Regime  B:  (p^  >  0,  p-  <  0  <  p+  <  1) 


u  = 


_  /  lPi(v~v+)2  iri+  <  y  <  i), 


0  (o  <  y  <  r/+), 

where  the  pseudo-plug  is  stationary. 

Q  =  lpz(l-r)+)3. 

Regime  C:  (0  <  V±  <  1) 

up  +  \pdy - v+)2  (v+<y<  i) 

up  (h-  <  y  <  rh 

Up  +  \p${y  -  h-)2  (0  <  y  <  r/_) 

where  the  plug  velocity  is  given  by  up  =  —  ^p^rpL- 

q  =  sbdd  -  V+f  +  V-~  3 il2-}. 


(89) 


(90) 
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Regime  D:  (p^  <  0,  0  <  rj-  <  1  <  rj+) 


+  \Pi{y-p_f  (0 

where  the  plug  velocity  is  given  by  up  =  —  ^p^rj^L- 

q  =  IphV-(V-  ~  3). 


(V-  <  V  <  !), 
(0  <  y  <  V- ), 


Regime  E:  ( p £  <  0,  77±  >  1) 


u  =  bay* 2 3 4  +  (r*  -pt~  R)y> 

q  =  -\pz  +  \{ts  -  R). 
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Dynamics  of  Vorticity  Defects  in  Stratified  Slrear 

Anubhab  Roy 
October  19,  2010 


1  Introduction 

Stability  of  shear  flows  at  high  Reynolds  number  has  been  one  of  the  cornerstones  of  hy¬ 
drodynamic  stability.  A  canonical  problem  which  contains  the  basic  features  is  that  of  a 
shear  layer  -  two  streams  of  different  velocity  flowing  past  each  other  with  different  density. 
Helmholtz [18]  and  Kelvin[21]  were  probably  the  earliest  to  consider  evolution  of  distur¬ 
bances  in  a  stratified  vortex  sheet.  While  attempting  a  more  general  case  Rayleigh[27] 
approximated  the  shear  layer  by  piecewise-linear  velocity  profiles. 

A  kinematic  description  of  the  instability  for  the  unstratified  vortex  sheet  was  given  by 
Batchelorfl]  purely  based  on  vortex  dynamics  (figure  1(a)).  Imagine  perturbing  the  vortex 
sheet  by  a  sinusoidal  disturbance  so  that  the  perturbed  interface  is  located  at  rj  =  sin  kx. 
In  the  neighbourhood  of  the  nodes  (A  and  B),  the  positive  vorticity  induces  a  clockwise 
circulating  velocity  field.  If  dij/dx  >  0,  the  crest  and  trough  move  away  from  each  other, 
leading  to  vorticity  being  swept  off  from  nodes  like  A,  whereas  if  di\/dx  <  0,  the  crest 
and  trough  come  closer  to  each  other,  leading  to  vorticity  being  swept  into  nodes  like  B. 
Thus,  accumulation  of  vorticity  at  points  like  B  takes  place  unboundedly  in  the  linear, 
non-dissipative  scenario,  giving  exponential  growth.  In  absence  of  stratification  the  vortex 
sheet  problem  as  studied  by  Helmholtz  and  Kelvin  is  unstable  to  infinitely  large  wavenum¬ 
bers.  This  ill-conditioned  nature  of  the  system  gets  removed  in  Rayleigh’s  problem  with  the 
introduction  of  a  length  scale  (figure  1(b)).  Rayleigh’s  problem  consists  of  two  interfaces 
of  vorticity  discontinuity.  The  Kelvin-Helmholtz  (KH)  instability  of  a  shear  layer  was  the 


Figure  1:  (a)  Vortex  sheet  (b)  Mixing  layer 
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(a)  Mixing  Layer  -  Lasheras  and  Choi  (1988) 


(b)  The  Meeting  of  Waters  -  Confluence  of  Rio  Negro  and  Rio  SolimSes 
near  Manaus,  Brazil  to  form  the  Amazon  ( Amazon  Abyss,  BBC) 

Figure  2:  Kelvin-Helmholtz  instabilties 


earliest  demonstrations  of  how  inviscid  instabilities  can  occur  due  to  presence  of  vorticity 
extremum  in  the  flow. 

KH  instability  is  commonly  observed  in  nature.  In  atmospheric  context  clouds  often  help 
to  visualise  them.  In  figure  2  KH  instabilities  are  seen  in  a  laboratory  experiment  and  also 
in  rivers. 

Though  KH  instability  considers  the  two  layers  to  be  of  two  different  density,  for  stable 
stratification  the  role  of  gravity  is  purely  stabilising.  Taylor[32]  and  Goldstein[17]  simulta¬ 
neously  showed  how  stable  stratification  can  have  a  destabilising  role  when  it  interacts  with 
shear.  Considering  various  multi-layer  velocity /density  profiles  Taylor  studied  instabilities 
arising  out  of  interaction  of  vorticity  and/or  gravity  waves.  We  will  revisit  one  of  Taylor’s 
problems  later  in  the  analysis. 

Subsequently  to  model  a  stratified  shear  layer  relevant  for  geophysical  problems,  Holmboe[20] 
considered  Rayleigh’s  piecwise-linear  approximation  with  an  embedded  density  interface  at 
the  shear  layer  centre  (figure  3).  For  weak  stratification  a  KH  like  behaviour  was  observed, 
instabilities  stationary  with  respect  to  the  flow.  On  increasing  stratification  a  new  class  of 
instabilities  emerged  -  a  pair  of  counter  propagating  waves  known  thereafter  as  Holmboe 
waves.  Unlike  KH  instability  Holmboe  waves  are  not  easily  visualised  in  nature  due  to  the 
nonstationary  behaviour  of  the  waves.  They  have  been  experimentally  realised  and  consid¬ 
ered  to  be  present  in  exchange  flows.  Often  a  distinction  is  made  between  KH  and  Holmboe 
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Figure  3:  Stratified  shear  layer 


waves  based  on  the  stationarity  nature  as  one  moves  with  the  speed  of  the  shear  layer  centre. 
Such  a  classification  though  accurate  in  the  symmetrical  case  becomes  ambiguous  in  the 
asymmetrical  case.  Recently[8]  it  was  shown  that  a  classification  based  on  wave  interactions 
-  vorticity-vorticity  or  vorticity-gravity  -  provides  a  consistent  explanation. 

Approaches  towards  understanding  evolution  of  disturbances  in  stratified  shear  layers  can 
be  mostly  classified  in  two  categories  -  Linear  stability  calculations  and  direct  numerical 
simulations  (DNS).  While  doing  linear  stability  studies  if  one  restricts  to  piecewise  linear 
profiles  then  dispersion  relations,  eigenfunctions  can  be  analytically  computed.  To  model 
realistic  profiles  if  one  considers  the  smoother  counterparts  then  solving  the  formidable 
Taylor-Goldstein  equation  (TGE)  is  the  only  recourse.  For  a  complete  nonlinear  descrip¬ 
tion  beyond  linear  instability,  DNS  holds  the  key.  DNS  enables  a  complete  exploration 
of  wide  range  of  length  and  time  scales  in  flows  transitioning  towards  a  turbulent  state. 
Having  said  that  the  computational  cost  and  effort  involved  in  a  complete  DNS  severely 
limits  an  exhaustive  study  in  nonlinear  dynamics  of  stratified  shear  layers,  many  questions 
go  unanswered.  How  does  increasing  stratification  alter  the  nonlinear  states  in  stratified 
shear  layers?  Does  the  flow  attain  a  nonlinear  stationary  state?  Gravity  waves  interact  in 
presence  of  shear  to  give  rise  to  Taylor- Caulfield  instability,  how  does  the  nonlinear  state 
evolve?  To  answer  many  such  questions  full  nonlinear  simulations  are  needed.  Is  there  a 
reduced  description  of  the  governing  equations  which  captures  the  essence  of  full  numeri¬ 
cal  simulations  while  giving  a  stronger  analytical  handle  on  the  physics?  The  goal  of  the 
present  work  is  to  outline  a  reductive  perturbation  theory  for  stratified  shear  layers.  The 
methodology  will  rely  on  a  previously  used  technique,  the  ‘vorticity  defect’  method. 

2  ‘Defect’  theory 

Inviscid  shear  flow  instabilities  occur  in  presence  of  vorticity  maximum,  e.g.  -  KH.  For 
KH  like  flows  one  can  immediately  notice  that  the  base-flow  quantities  vary  across  a  very 
small  region.  This  is  also  true  for  stratified  shear  layers  where  besides  vorticity,  variations 
of  density  occur  over  a  small  region.  To  obtain  a  reduced  description  we  will  try  to  exploit 
the  smallness  of  the  shear-layer  thickness.  The  perturbative  technique  to  be  adopted  will 
in  spirit  be  akin  to  boundary  layer  theory.  We  will  have  an  ‘outer’  region  where  the  pertur- 
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Figure  4:  Velocity  and  density  profiles  with  ‘defects’ 


bations  will  be  assumed  to  obey  linear,  steady  and  inviscid  dynamics  and  an  ‘inner’  region 
where  nonlinearity,  dissipation  and  unsteadiness  will  be  important.  A  key  assumption  of  the 
analysis  will  be  considering  the  base-flow  quantities  (velocity,  density)  to  be  monotonically 
varying  quantity  with  sharp  variations  or  ‘defects’  of  width  e  at  the  shear  layer  centre. 

While  studying  wake  behind  a  airfoil,  Gill[16]  modelled  the  base-state  to  be  a  Couette  flow 
with  slight  distortions.  This  severely  simplified  the  linear  stability  calculations  and  provided 
integral  dispersion  relationships.  Next  Lerner  and  Knobloch[24]  performed  long-wavelength 
stability  studies  on  a  distorted  Couette  flow.  The  slight  modification  to  Couette  flow  was 
assumed  to  arise  due  to  finite  amplitude  perturbations.  But  it  wasn’t  until  the  works  of 
Balmforth  and  co-workers  that  the  ‘defect’  model  -  a  Couette  flow  with  slight  distortions 
-  got  a  strong  theoretical  foundations  and  its  immense  applications  appreciated.  Using 
an  analysis  similar  to  critical  layer  theory  del-Castillo  Negrete  et  al.  [15]  (Balmforth  et  al. 
[3])  showed  that  for  inviscid  shear  flows  one  can  reduce  2D  Euler  equations  to  a  nonlin¬ 
ear  integro-differential  equation  for  the  ‘defect’  vorticity.  Exploiting  the  ‘defect’  equations 
analogy  with  the  Valsov-Poisson  equation1  explicit  dispersion  relations  were  obtained  for 
various  distortion  imposed  on  the  Couette  flow.  Balmforth  and  Young[5]  went  ahead  and 
highlighted  evaluation  of  nonlinear  stationary  states  from  the  ‘defect’  equations.  The  lin¬ 
ear  viscous  problem  was  addressed  by  Balmforth  [6]  and  the  intriguing  connection  between 
viscous  and  inviscid  eigensolutions  was  discussed  in  context  to  various  ‘defect’  profiles.  The 
applicability  of  the  approach  for  visco-elastic  flows  has  also  been  studied  [4],  In  the  present 
work  we  will  try  to  obtain  better  understanding  of  stratified  shear  layers  aided  by  the  ‘de¬ 
fect’  theory. 

Consider  a  Bousinessq  fluid  of  density, 

p(x,  y,t)  =  pm  +  5p(x,  y,  t)  (1) 


The  evolution  equations,  vorticity-buoyancy  formulation,  in  the  Bousinessq  approximation 
can  be  written  in  the  following  non-dimensional  form, 


du  d(V,u)  dB  1  o 

dt  d(x,y)  dx  Re 

8B  d(%B)  =  1  2 

dt  d(x,  y)  ReSc 


(2) 

(3) 


1Vlasov-Poisson  equation  describes  time  evolution  of  the  distribution  function  of  plasma  consisting  of 
collision-less  charged  particles  in  the  zero  magnetic  Held  limit. 
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The  horizontal  and  vertical  coordinates  have  been  non-dimensionalised  by  a  characteristic 
length-scale  Lq  and  time  by  Uq/Lq,  Uq  being  a  velocity  scale.  Stream-function  is  non- 
dimensionalised  by  U0L0.  co  =  V2^  and  B  =  —g5p/ pmLo/UQ  are  the  non-dimensional 
vorticity  and  buoyancy  respectively.  Reynolds  number,  Re  =  UoLq/u  [y  is  the  kinematic 
viscosity),  denotes  a  ratio  of  inertial  to  viscous  forces  and  Schmidt  number,  Sc  =  v/k  (k  is 
the  mass  diffusivity) ,  provides  a  ratio  of  momentum  to  mass  diffusivity.  We  will  focus  on 
Sc  — >  oo,  the  relevant  regime  for  stratified  shear  layers2. 

As  evident  in  figure  4,  velocity  is  assumed  to  be  a  localised  ‘defect’  imposed  on  a  Couette 
flow  and  similarly  buoyancy  remains  largely  uniform  but  for  a  sharp  variation  at  the  ‘defect’ 


level.  Thus  if  the  variations  occur  over  a  length-scale  e,  we  can  write, 

^  =  ~\y2  +  e2ip(x,y,t)  (4) 

B  =  bm  +  e2&  (5) 

Hence  the  vorticity-buoyancy  equation  can  be  reduced  to, 

V2V>t  +  yV 2ipx  +  e2(ipxV2ipy  -  ipyV2ipx)  =  bx  +  i?e_1V4V>  (6) 

bt  +  ybx  +  e2(ipxby  -  ipybx)  =  ( ReSc)~lS72b  (7) 


The  above  equations  can  be  solved  using  matched  asymptotic  theory  by  splitting  the  flow 
into  an  exterior  region  where  base  flow  quantities  remain  uniform  or  vary  monotonically 
and  an  interior  region  corresponding  to  the  ‘defect’. 

2.1  Outer  solution 

The  dynamics  in  the  outer  region  is  assumed  to  be  steady,  inviscid  and  linear  in  the  leading 
order.  Another  important  assumption  made  regarding  the  outer  solution  is  that  of  weak 
stratification.  It  is  assumed  that  the  outer  solution  for  ip,  in  the  leading  order,  is  unin¬ 
fluenced  by  buoyancy.  As  would  become  evident  that  in  doing  so  we  deal  with  a  weaker 
singularity  in  the  outer  problem.  An  assumption  of  these  form  becomes  unphysical  for 
strongly  stratified  flows,  scenarios  beyond  the  scope  of  stratified  ‘defects’. 

A  regular  perturbation  expansion  of  the  solution  in  the  outer  region  provides, 

ip  =  ip0  +  eipi  +  . . .  (8) 

b  =  ebi  +  ...  (9) 

Thus  the  leading  order  dynamics  is  governed  by  the  following  pair  of  equations. 

yV2V’0rr  =  0,  ybix  =  0  (10) 

=>  VVo  =  -2  A(x,t)S(y)  (11) 

Thus  the  outer  solution  behaves  like  an  irrotational  flow  being  forced  by  a  vortex  sheet,  a 
5  function  singularity,  at  the  ‘defect’  level  -  y  =  0  (buoyancy  also  gets  forced  by  a  density 

2In  the  context  of  passive  scalar  turbulence  Sc  — >  oo  is  known  as  the  Batchelor  regime.  Such  a  regime 
allows  for  rapid  variations  in  passive  scalar  field  due  to  stirring  by  a  smooth  velocity  field. 
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sheet).  This  necessitates  a  jump  in  tangential  velocity  (while  maintaining  continuity  of 
normal  velocity), 

[V’oi/ljlj-  =  -2  A(x,t),  \MVyS-  =  0  (12) 

As  will  be  seen  this  jump  in  the  outer  solution  will  be  smoothened  out  over  a  region  e  by  the 
inner  solution.  If  we  had  allowed  for  leading  order  buoyancy  effects  in  the  outer  vorticity 
equation,  the  flow  would  have  been  forced  by  a  vortex  dipole  instead  of  a  vortex  sheet.  The 
consequence  of  these  stronger  singularity,  a  S'  singularity,  would  have  then  resulted  in  a 
jump  in  normal  velocity.  Weak  stratification  allows  us  to  overlook  such  a  possibility  in  the 
present  analysis. 

Introducing  Fourier  transform  of  the  streamfunction  as, 

poo 

4>(k,y,t)  =  J7['il>(x,y,t)\=  ip(x,y,t)e~lkxdx  (13) 


equation  11  can  be  written  as, 


Thus  one  obtains, 


4>o yy  -  k2i>0  =  -2 A(k,t)5(y) 


V>o  =  J7-1 


g{k)A(k,t) 


(14) 


(15) 


where,  J-  1  denotes  the  inverse  Fourier  transform  and  the  Green’s  function,  Q(k),  can  be 
computed  for  both  unbounded  and  bounded  domains  (|y|  ^  1)  as, 

bounded  (16) 

unbounded  (17) 


Q(k)  =  j^j-sech|fc|  sinh  |A:|(1  -  \y\), 


Henceforth  all  analysis  will  be  presented  for  the  unbounded  case  but  the  results  have  been 
extended  to  the  bounded  case  too.  The  ‘defect’  analysis  is  not  solely  restricted  to  the  outer 
flow  being  Couette  and  can  be  extended  to  more  general  flows  by  the  method  of  constructing 
Green’s  function  for  flow  profiles  with  curvature[2]. 

Next  we  will  regularise  the  discontinuity  in  outer  velocity  field  by  evaluating  the  inner 
solution. 


2.2  Inner  solution 

Similar  to  a  traditional  boundary-layer  like  analysis,  the  inner  solution  must  contain  at  least 
one  of  the  physics  that  have  been  overlooked  in  the  outer  region  -  viscosity,  nonlinearity 
and  unsteadiness.  Working  with  our  small  parameter,  the  defect  thickness  e,  we  define  a 
stretched  vertical  coordinate,  77  =  y/e  and  a  slow  time-scale  r  =  et.  It  needs  to  be  mentioned 
that  the  method  of  analysis  bears  similarities  with  standard  critical  layer  theories  ([33], [31]). 
Beyond  the  initial  transients  the  0(e2)  nonlinear  term  acting  on  a  critical  layer  of  width 
0(e)  becomes  important  after  time  0(l/e).  As  would  become  clear  that  the  ‘defect’  theory 
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is  considerably  simpler  than  critical  layer  analysis.  We  pose  the  following  expansion  in  the 
inner  variables, 


ip  =  ipo(x,  0,  t)  +  €(pi(x,  rj,  t )  +  0(e2),  (18) 

Ipy  =  <Plr)  +  €<l>2ri  +  0(e 2),  (19) 

=r-  V  Ip  =  6  (plryri  T  'P’Oxx  T  <p2r]ri  T  0(f)  (20) 

6  =  #(x,77,t)  +  0(e)  (21) 


From  above  equations  it  is  evident  that  leading  order  inner  solution,  tpo(x,  0,  t),  matches  the 
corresponding  outer  solution  at  y  =  0.  As  mentioned  before  the  discontinuity  in  tangential 
velocity  in  the  outer  solution  will  be  smoothed  by  the  inner  solution, 

/OO 

Zdrj  (22) 

-OO 

where  Z  =  </>iw  is  the  leading  order  defect  vorticity.  The  velocity  jump  equals  the  vorticity 
integrated  across  the  defect. 

Substituting  the  expansion  18  in  equations  6  and  7  and  collecting  the  leading  order  terms 
one  obtains  the  ‘defect’  vorticity-buoyancy  equations, 

ZT  +  r\Zx  +  <&xZ-q  =  Bx  +  AZ,m,  (23) 

Bt  +  7]Bx  +  <&xBr)  =  (24) 

(. Z,t3 )  — >  0,  as  \rj\  — >  oo 

i  r  oo  i  roo 

i  =  (25) 

where,  'H(-)  denotes  the  Hilbert  transform  of  a  variable  defined  in  terms  of  the  following 
Cauchy  principal  value  integral, 

H(u)  =  1^  (26) 

and  A  =  (e3i?e)”1  is  representative  of  the  ratio  of  nonlinearity  and  viscosity.  In  critical  layer 
theory  there  exists  a  similar  quantity,  Haberman  parameter,  which  denotes  competition  of 
nonlinearity  and  viscosity. 

Equations  23-25  are  the  ‘defect’  vorticity-buoyancy  equations  relevant  for  stratified  shear 
layer  studies.  During  the  course  of  reduction  we  have  retained  the  necessary  physics  due 
to  unsteadiness,  nonlinearity  and  dissipation.  Though  the  obtained  nonlinear  integro- 
differential  equations  appear  more  daunting  than  the  governing  equations  we  started  off 
with  it  would  become  clear  that  the  ‘defect’  equations  are  more  amenable  to  analytical 
and  numerical  treatments.  Similar  equations  have  been  derived  in  critical  layer  theories  for 
stratified  shear  flows  [14]. 
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3  Conservation  laws 


The  inviscid,  nonlinear  ‘defect’  equations  (equation  23-24  for  A  =  0)  contains  certain  con¬ 
served  quantities,  some  deducible  from  evident  symmetries  in  the  problem  and  others  not 
so  obvious  ones.  Lets  us  define  a  spatial  average, 


<  .  >= 


In  absence  of  dissipation  both  average  vorticity  and  buoyancy  remains  conserved, 


<  Z  >r=  0,  <  B  >T=  0 


(27) 


momentum  remains  conserved, 

<  rjZ  >T=  0 

and  energy  also  is  conserved, 

n2  —  4? 

<  -L^z-VB  >T=  0 


(28) 

(29) 


As  always  is  the  case  with  ideal  fluid  besides  the  above  ones  there  exists  an  infinitude  of 
conserved  quantities  -  Casimirs.  For  2D  homogeneous  fluid  any  functional  of  vorticity  is  a 
conserved  quantity.  Enstrophy  -  a  representative  of  vortex  stretching  and  tilting  -  is  one 
such  commonly  used  candidate.  For  stratified  flows  due  to  baroclinic  generation  of  vorticity 
enstrophy  is  no  longer  conserved.  For  stratified  ‘defects’  one  instead  has  the  any  functional 
of  buoyancy  and  product  of  vorticity  with  any  buoyancy  functional  as  the  Casimirs, 


<  G(B)  >T=  0,  <  ZF(B)  >r=  0 


(30) 


The  conserved  quantities  thus  obtained  help  in  obtaining  the  stationary  states  for  the  system 
and  also  comment  on  its  nonlinear  stability.  More  importantly  it  also  helps  as  a  check  in 
numerical  computations  to  ensure  that  the  numerical  method  indeed  conserve  the  necessary 
invariants. 


4  Inviscid  linear  stability 

Earlier  it  was  mentioned  that  previous  studies  on  flow  with  slight  distortions  exploited  the 
simplicity  of  the  model  in  deriving  explicit  integral  dispersion  relations.  Conventionally 
inviscid  linear  stability  demands  solution  of  Rayleigh  equation  or  in  the  case  of  stratified 
shear  flow  Taylor  Goldstein  equation  (TGE), 

d 2  _  21  |  _  v"j>  _  n 

dy 2  (U  —  c)  (U  -  c)2 

where  is  the  disturbance  streamfunction,  U  the  background 
the  BruntVaisala  (buoyancy  time-scale),  k  is  the  disturbance  wave-number  and  c  is  the 
wave-speed.  The  relative  strength  of  stratification  vis-a-vis  inertia  is  denoted  by  Richardson 
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number,  Ri  =  (N Lq/Uq)2 .  Solution  of  the  above  boundary  value  problem,  even  numeri¬ 
cally,  is  no  easy  task  and  analytical  solutions  can  be  obtained  only  for  broken-line  profiles. 
The  ‘defect’  model  circumvents  this  difficulty  with  ease. 

Consider  the  ‘defect’  vorticity  and  buoyancy  to  be  decomposed  of  a  base-state  contribution 
and  a  perturbation, 


Z(x,  ??,  r)  =  F{rj)  +  ((x,  rj,  r) 
B(x,t],t)  =  J  N2(y)dy  +  P(x,r],T) 


(32) 

(33) 


Substituting  the  above  expressions  in  equations  23  and  24  and  considering  the  linearised 
equations  one  has, 


Cr  T  7]Cx  4“  fx^i)  —  fixi  (34) 

fir  +  vfix  +  rxN2  =  0  (35) 


On  assuming  a  normal-mode  form,  C(X>V ,  r)  =  C( r])eik(x  CT\  the  eigenfunctions  can  be 
expressed  as, 


b  = 
c  = 


fN 2 
(V  ~  c)  ’ 
fFv  fN 2 

( rj  —  c)  ( r ]  —  c)2 


(37) 

(38) 


and  the  dispersion  relation, 


T>(c,  k;  F,  N)  =  2k 


En.  |  Z 

( r]  —  c)  (rj  —  c)2 


dy  =  0 


(39) 


This  is  the  stratified  version  of  the  integral  dispersion  relation  obtained  in  [3].  T>  is  an 
analytic  function  of  c  but  for  the  real- axis  where  it  has  a  branch-cut.  The  branch-cut  gives 
rise  to  the  continuous  spectrum,  a  common  feature  of  inviscid  shear  flows.  The  presence 
of  the  continuous  spectrum  contributes  to  transient  growth  in  flows  which  otherwise  are 
modally  stable  -  Orr  mechanism  being  a  well  known  example.  Sometimes  the  dispersion 
relation  instead  of  having  zeros  in  the  principal  complex  plane,  vanishes  at  points  on  dif¬ 
ferent  Riemann  sheets.  While  doing  the  initial  value  problem  this  leads  to  observation  of  a 
collective  behaviour  of  the  continuous  spectrum  as  a  single  damped  discrete  mode  -  known 
as  Landau  pole  or  quasi-mode.  This  damping,  eponymously  known  as  Landau  damping  in 
plasma  physics,  is  a  purely  inviscid  phenomenon  arising  due  to  phase  mixing.  Quasi-modes 
have  been  probed  to  understand  the  mysterious  connection  between  the  inviscid  and  vis¬ 
cous  initial  value  problem  in  the  limit  of  vanishing  viscosity.  The  present  analysis  will  focus 
purely  on  unstable  modes  and  hence  no  longer  address  the  above  intriguing  phenomena. 
We  will  now  compute  dispersion  relations  for  various  stratified  shear  layer  configurations. 
Before  proceeding  on  to  calculations  for  both  broken  and  smooth  profiles  a  note  can  be 
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made  borrowing  from  Nyquist  theory.  For  a  closed  curve  C  in  the  complex  c  plane,  the 
winding  number, 


dc  =  N-P 


(40) 


2?n,  Jc  V(c) 

where  N  is  the  number  of  zeros  and  P  gives  the  number  of  poles  of  D(c)  inside  C.  Considering 
V  an  analytic  function  of  c  the  winding  number  gives  us  the  number  of  eigenmodes.  Earlier 
works  on  ‘defect’  theory  like  [3]  and  [6]  elucidates  how  Nyquist  theory  can  be  used  effectively 
for  the  ‘defect’  model. 


4.1  Holmboe  instability 

Holmboe  instability  occurs  when  there  exists  a  density  interface  embedded  in  a  mixing  layer. 
It  is  a  popular  model  to  represent  instabilities  arising  due  to  horizontal  shearing  of  density 
interfaces.  This  classical  instability,  in  the  symmetrical  case,  exhibits  a  transition  from 
the  stationary  Kelvin-Helmholtz  instability  to  the  propagating  Holmboe  waves.  Though 
Holmboe  waves  are  inherently  difficult  to  visualise  in  nature  due  to  their  non-stationary 
behaviour  they  have  been  observed  in  laboratory  experiments  -  exchange  flow  between  two 
basins  of  different  density  being  an  example. 


4.1.1  Broken-line  profile 

For  a  broken-line  Holmboe  profile  denoted  by  the  following  defect  vorticity  and  buoyancy 
profile, 

Fv  =  fl5(v  ~  Vo)  -  ${v  +  Vo)], 

N2  =  d5(rj) 

one  obtains  the  dispersion  relation, 

c4  -  (i  -  {  +  c2  +  ^  =  °,  r)o  =  l,c  =  cr  +  ici  (43) 

It  was  previously  mentioned  that  the  ‘defect’  theory  is  suited  to  weak  stratification  cal¬ 
culations.  Thus  the  results  obtained  are  representative  of  small  Ri  cases  in  the  complete 
problem,  d  determines  the  strength  of  stratification  and  a  qualitative  comparison  is  made 
of  contours  of  growth-rate  in  d  —  k  plane  with  the  one  obtained  from  the  solution  of  TGE  for 
a  symmetric  shear  layer.  The  basic  features  are  well  captured  by  the  present  model  (figure 
5).  One  immediate  observation  is  the  differentiation  of  stationary  (cr  =  0)  and  propagating 
instabilities  (cr  /  0). 


(41) 

(42) 


4.1.2  Smooth  profile 


For  smooth  profiles  the  integral  dispersion  relation  can  be  evaluated  using  Cauchy  residue 
theorem.  The  simplest  profile  to  consider  would  be  a  Lorentzian. 


Fv  = 
N2  = 


af 

TT 

ad 


1 


( r]  +  l)2  +  a2 
1 

tt  rj2  +  a2 


1 

(?y  —  l)2  +  a2  ’ 


(44) 

(45) 
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Figure  5:  Holmboe  instability,  a)  A  broken-line  profile,  b)  Contours  of  growth-rate  (cj)  in 
d  —  k  plane  (/  =  2),  c)  Growth-rate  contours  in  Ri  —  k  plane  from  a  complete  linear  stability 
calculations  (Carpenter  et  al.  [8]) 


For  the  Lorentzian  profile  the  dispersion  relation  is  identical  to  equation  43  with  c  in  the 
quadratic  expression  no  longer  being  the  complex  wave-speed  but  instead  replaced  by  c  — > 
c  +  iasgn(cj). 

A  smooth  profile  which  would  be  used  in  the  numerical  calculations  and  represents  sharper 
variations  in  flow  quantities  is  a  tanh  profile. 


tanh 


N 2 


_d_ 

2A 


sech2 


V  ~  Vo\ 

A  ) 


tanh 


V  +  Vo\ 

A  ) 


(46) 

(47) 


One  could  note  that  the  both  the  smooth  profiles  chosen  above  are  functions  which  are 
nascent  delta  functions,  i.e.  as  a,  A  — >  0  we  obtain  the  generalised  functions  form  as  given 
in  equation  42. 

From  residue  calculus  we  can  evaluate  the  following  integrals, 


Xi 


fC 

l  (A,  ?7o,  c)  =  / 

J  —  C 

fC 

j(A,  7/0,  c)  =  / 

J  —  c 


sech2 


2  (  V  ~  Vo 


sech2 


iv-c) 

2  (  V  -  Vo 


dr i  =  2iri 


sech2  s  -  r2^(1)(«) 


7T 


(r)  -  cf 


dr]  =  — 


27 ri 

A 


2sech2 


2  (  c  -  Vo 


tanh 


c  -  ??o 
A 


s  +  4r^(2)(a) 


7T 


1  i  1 

a  =  2  ~  vrA^°  _  S=2^1  +  Sgn^)) 


where  ^'(1)(a)  and  ^^2)(a)  are  polygamma  functions  -  ^n\z)  =  dn+l (\ogT (z)) / dzn+1 . 
Based  on  the  above  evaluated  integrals  the  dispersion  relation  for  the  tanh  Holmboe  profile 
can  be  computed  explicitly  as, 

2k  ~  AA  [/I2 i(A,7/0,c)  -Xi(A,-77o,c)}  +  dX2(A,0,c)]  =  0  (48) 
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Figure  6:  Holmboe  instability,  a)  A  Smooth  profile,  b)  Variation  of  cr  (dashed)  and  q 
(continuous)  with  stratification  (d)  for  /  =  2,  k  =  1,  A  =  1/3,  r/o  =  1 

Though  we  have  obtained  an  analytical  dispersion  relation  for  the  tanh  profile  the  numerical 
evaluation  of  the  integrals  is  more  efficient  than  working  with  the  polygamma  functions. 

Figure  6  shows  the  stationary  and  propagating  nature  of  the  instabilities  for  a  set  of 
parameter  values.  Cases  from  the  instability  curves  will  be  later  visited  during  numerics. 
An  important  question  that  numerics  will  try  to  answer  is  how  does  the  nonlinear  states 
change  as  one  increases  stratification  and  transitions  from  a  IvH  instability  to  Holmboe 
instability. 

4.2  Taylor-Caulfield  instability 

Holmboe  instability  arose  when  gravity  waves  interacted  with  vorticity  waves.  Taylor [32] 
during  the  course  of  studying  various  broken  line  profiles  observed  that  a  stable  stratification 
composed  of  two  density  jumps  gets  destabilised  if  there  is  an  imposed  shear.  This  initially 
surprising  result  can  be  understood  as  interaction  of  gravity  waves  facilitated  by  shear. 

4.2.1  Broken-line  profile 

For  Taylor-Caulfield  instabilities  there  is  no  defect  vorticity  and  instead  one  has  variations 
of  defect  buoyancy, 


N2  =  d[5{r)  -  770)  +  5(r)  +  770)]) 


(49) 


As  seen  in  the  case  of  Holmboe  instability  one  obtains  polynomial  dispersion  relations  for 
broken-line  profiles, 


c 


4 


c2  + 


7/0  —  1,  C  —  Cy  T 


(50) 
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Figure  7:  Taylor-Caulfield  instability,  (a)  A  Smooth  profile,  (b)  Growth-rate  (cj)  variation 
with  stratification  (d)  for  k  =  1, r/o  =  1 


4.2.2  Smooth  profile 

Similar  to  Holmboe  case  one  can  calculate  stability  characteristics  for  various  smooth  pro¬ 
files.  For  a  Lorentzian  profile, 

(77  +  l)2  +  a2  (77  —  l)2  +  a2  ^  ^ 

The  dispersion  relation  is  identical  to  equation  50  with  c  in  the  quadratic  expression  no 
longer  being  the  complex  wave-speed  but  instead  replaced  by  c  — >  c  +  iasgn(cj). 

For  a  smooth  tanh  profile, 


7 r 


N2 


2A 


sech2 


V  Vo\ 

A  ) 


+  sech2 


V  +  r?0\ 

A  J 


(52) 


Once  again  the  analytical  form  of  the  dispersion  relation  for  can  be  written  as, 

2k  -  ^-[X2(A,77o ,c)  +X2(A,  -770,  c)]  =  0  (53) 

Figure  7  illustrates  the  stability  characteristics  for  Taylor-Caulfield  instability  as  obtained 
from  ‘defect’  theory.  The  TC  modes  are  purely  stationary  waves. 

Though  there  exists  some  linear  stability  calculations  and  DNS  calculations  very  little  is 
known  about  the  nonlinear  states  of  Taylor-Caulfield  instabilities.  Next  using  numerical 
solution  of  ‘defect’  equations  we  will  try  to  explore  the  details. 


5  Numerical  solution 

A  significant  advantage  that  the  ‘defect’  theory  offers  over  the  complete  Bousinnessq  Navier- 
Stokes  equations  is  in  numerical  computation.  As  is  commonly  known  that  the  notorious 
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advective  nonlinearity  in  Navier-Stokes  serves  as  a  severe  bottleneck  in  full-fledged  compu¬ 
tations.  The  defect  equations  for  Sc  — >  oo, 

ZT  +  r]Zx  +  <&XZV  =  Bx  +  A  (54) 

Br  +  qBx  +  *&xBri  =  0,  (55) 

(Z,  B)  — »•  0,  as  1 77I  — >  00 

1  poo  1  poo 

*  ^n(*)x  =  --j  Zdr,  (56) 

though  nonlinear  avoids  the  difficult  nonlinearity  9(<^>i,  (j>ir]ri)/d(x,  q)  =  2).  In  the 

present  form  the  ‘defect’  equation  entails  solving  a  set  of  advection-diffusion  equations. 
The  method  adopted  in  the  present  work  relies  on  exploiting  the  similarities  of  the  ‘defect’ 
equations  with  the  Vlasov-Poisson  equation  and  using  a  operator-splitting  technique. 

5.1  Operator  splitting  technique 

The  operator  splitting  technique  used  is  based  on  the  method  outlined  in  [13]  for  solution  of 
Vlasov-Poisson  equation  and  in  [7]  for  studying  critical  layers  in  2D  vortices.  This  splitting 
technique  is  also  known  as  Strang  splitting[25]. 

The  integration  over  time-step  [r,  r  +  5t\  is  divided  into  three  stages, 

5.1.1  Advect  in  x  for  half  time-step 

Consider  the  advection  in  x  in  the  buoyancy  equation, 

BT  +  qBx  =  0  (57) 

for  a  time  interval  Sr /2.  The  exact  solution  is  B(x,  q,  t  +  St/2)  =  B(x  —  r/dr/2,  r/,  t).  Fourier 
interpolation  is  used  to  perform  the  x  advection  -  B(k,i],r  +  5t / 2)  =  B(k,  77,  T)e-?fcr?151"/2. 
For  the  vorticity  equation, 

£r  +  VZX  =  Bx  (58) 

the  solution  can  be  similarly  written  in  Fourier  space  as  - 

Z(k,  rj,T  +  St/ 2)  =  Z(k,  q,  T)e~lkriST^2  +  ikSr/2  B(k,  q,  r  +  St/ 2) 

The  above  calculations  in  Fourier  space  is  done  using  the  FFT /IFFT  routine  in  Matlab. 

5.1.2  Advect  in  q  for  a  complete  time-step 

Both  buoyancy  and  vorticity  obeys  identical  q  advection  equation, 

BT  +  <bxBn  =  0  (59) 

Before  performing  the  advection  is  computed  by  integrating  Z  in  q,  once  again  in  Fourier 
space.  The  q  advection  occurs  with  a  velocity  -  B(x,  q,T  +  St)  =  B(x,  q  —  &xSt,  t).  Due 
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to  lack  of  periodicity  we  use  linear/cubic  spline  interpolation  in  rj  instead  of  Fourier.  The 
same  method  is  followed  for  advection  of  Z.  If  advection  happens  into  the  domain  from 
both  above  or  below  the  computational  domain  then  it  is  assigned  the  far-held  equilibrium 
value. 


5.1.3  Advect  in  x  for  another  half  time-step 

The  first  step  is  repeated  for  another  half  time-step,  St/2.  One  can  combine  the  first  and 
third  step  and  do  two  steps  instead  -  advect  in  q  for  St  followed  by  advection  in  x  for 
another  St. 


To  account  for  diffusion  of  vorticity  (A  /  0)  a  forward  in  time  Euler  integration  is  car¬ 
ried  out  with  diffusion  operator  in  q  discretized  using  finite-difference  technique. 


The  computational  domain  consists  of  (x,  rj)  G  [0,  2n\  x  [—q maxiVmaxl-  Vmax  is  fixed  at 
10  so  that  the  vertical  extent  is  at  least  5  times  larger  than  the  shear  layer  and  the  far-held 
is  ensured  to  remain  uninfluenced  by  the  dynamics  in  the  shear  layer.  The  choice  of  time- 
step,  St,  and  horizontal  discretization,  h  =  2 n/N,  ( N  is  the  number  of  Fourier  modes  used) 
is  determined  by  the  following  stability  condition, 

\Vmax\AT  %  h  (61) 


For  the  present  work  N  has  been  chosen  to  either  256  or  512  and  the  time-step,  At  = 
1  x  1CF4,  5  x  1CT4.  The  vertical  discretization,  A  rj,  is  chosen  so  as  to  obey  the  von  Neumann 
stability  criteria, 


AAt  1 
(A q)2  ^  2 


(62) 


A q  =  0.01,0.005  is  used  in  the  calculations.  A,  an  estimate  for  momentum  diffusion,  has 
been  widely  varied  from  0.05  —  10~4.  All  numerics  reported  hereafter  are  for  A  =  10-3. 
To  instigate  instabilities  in  the  hows  the  base-state  is  initially  seeded  with  a  disturbance 
of  the  form  e~v  cos  kx  of  magnitude  0.1%  of  the  base-state  maxima.  The  q  interpolation 
has  been  done  using  linear  interpolation.  The  interpl  command  in  Matlab  has  been  used 
for  the  purpose.  For  stratihed  KH  (section  5.3.1)  a  grid  resolution  of  256  x  2001  has  been 
used  with  a  time-stepping  of  5  x  10~4.  The  remaining  calculations  have  been  done  on  a 
512  x  4001  grid  with  At  =  1  x  10~4. 


5.2  Analytical  comparisons 

To  check  the  efficacy  of  the  operator  splitting  technique  a  comparison  is  made  between  the 
numerical  solution  and  analytical  solution  known  for  a  passive  scalar  advection  equation. 
The  passive  scalar  advection  equation, 

Xt  +  VXx  +  2Xv  cos  2x  =  0  (63) 

was  studied  by  Stewartson[31]  in  the  context  of  critical  layers.  The  above  equation  can  be 
solved  using  method  of  characteristics  and  drawing  analogy  with  motion  of  pendulum.  For 
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Figure  8:  Comparison  of  analytical  and  numerical  results  for  the  passive  scalar  problem  at 
x  =  3tt/4.  x  is  x(3tt/4,  77, 0)  =  1/(1  +  rf) 


x  =  7t/4  or  37r/4  one  can  obtain  explicit  expressions  for  x-  At  x  =  37r/4  if  initially  the 
distribution  of  x  is  x(3tt/4,  77,  0)  =  f{rf)  then  the  solution  at  arbitrary  times  will  be[7], 
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(64) 


where  dn  is  the  Jacobi  elliptic  function  and  K(m)  the  complete  elliptic  integral  of  first 
kind.  Figure  8  shows  the  good  agreement  between  numerical  and  analytical  results  for  a 
Lorentzian  initial  condition.  The  agreement  starts  deteriorating  with  time  due  to  generation 
of  increasing  fine-scales. 


5.3  Holmboe  instabilities 


By  increasing  stratification  we  will  try  to  explore  various  nonlinear  dynamics  for  instability 
scenarios  occurring  in  Holmboe  instabilities  starting  from  stratified  KH  and  proceeding  on 
to  pure  Holmboe  waves.  For  the  purpose  of  exploring  various  nonlinear  states  the  tanh 
model  discussed  before  is  revisited. 
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/  =  2 ,k  =  1,A  =  1/3,770  =  1  are  the  parameters  maintained  identical  for  all  Holmboe 
simulations.  We  choose  3  values  of  d.  d  =  0.1, 1.5  8.  As  visible  from  figure  6,  d  =  0.1 

denotes  stationary  stratified  KH  waves  in  linear  regime,  d  =  1.5,  8  denote  linear  propagating 
Holmboe  waves  (d  =  8  is  not  shown  in  the  figure).  We  will  try  to  find  how  the  nonlinear 
states  behave  for  these  cases.  Does  a  linear  Holmboe  wave  remain  so  always  howsoever 
weak  the  stratification  may  be? 
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5.3.1  Stratified  KH  ( d  =  0.1) 

Kelvin-Helmholtz  in  presence  of  weak  stratification  contains  all  the  usual  traits  of  the  un¬ 
stratified  version  -  growth  of  instability  waves,  roll-up  of  the  shear  layer,  vortex  nutation 
(the  horizontal  rocking  of  partially  form  cat’s  eye)  and  finally  culminating  with  the  cele¬ 
brated  cat’s  eye  pattern  or  the  KH  billow.  Besides  the  weakened  growth-rate  due  to  the 
stabilising  effect  of  stratification  the  stratified  version  exhibits  intense  stirring  and  mixing 
of  the  buoyancy  field.  3D  effects  like  elliptical  instability  of  the  billow  and  hyperbolic  in¬ 
stability  of  the  braid  also  emerge  later  in  the  evolution.  Interested  reader  can  find  detailed 
discussions  in  the  article  by  Caulfield  and  Peltier  [12]. 

Figure  9  shows  the  evolution  of  buoyancy  (B)  and  vorticity  (Z)  field  at  different  instants 
of  time.  Figure  9(a)-(b)  is  a  snapshot  when  the  flow  exhibits  finite  amplitude  waves  just 
prior  to  the  roll-up.  (c)-(d)  is  at  a  stage  during  the  formation  of  the  billow.  The  flow  has 
departed  from  linear  stability  predictions  and  vortex  nutations  are  observed  at  this  stage. 
Finally  in  figure  9(e)-(f)  the  well  homogenised  KH  billow  is  visible.  Figure  10  shows  the 
time-evolution  of  <  |^|2  >1/2,  a  measure  of  disturbance  amplification.  Signatures  of  the 
features  discussed  above  can  be  seen  in  the  time-trace. 

5.3.2  Pure  Holmboe  ( d  =  8) 

Next  we  explore  a  case  where  the  flow  is  buoyancy  dominant  and  the  shear  layer  dynamics 
are  very  different  from  KH.  For  strongly  stratified  shear  layers  the  flow  exhibits  propagating 
Holmboe  instabilities.  They  are  identified  by  oppositely  travelling  cusps  which  spews  fluid 
from  one  region  to  other.  Compared  to  a  KH  billow  the  Holmboe  waves  are  highly  inefficient 
in  mixing.  The  dynamics  is  very  often  given  by  the  beating  phenomena  of  the  two  Holmboe 
waves.  Laboratory  experiments  on  Holmboe  waves  have  been  done  by  Lawrence  et  al. 
[22],  Pouliquen  et  al.[26]  and  several  researchers  thereafter.  Efforts  towards  using  DNS  to 
understand  Holmboe  wave  dynamics  was  done  by  Smyth  and  co-workers  ([28],  [29]  and 
[30])  amongst  others.  A  recent  work  by  Carpenter  et  al. [9]  considers  both  experimental  and 
numerical  simulation  aspects  of  the  problem. 

From  figure  11  we  observe  that  the  ‘defect’  numerics  are  adept  at  capturing  the  well-known 
traits  of  Holmboe  instability.  Linear  stability  predicts  presence  of  2  unstable  modes  of  same 
growth-rate  but  opposite  phase-speed.  As  expected  the  numerical  solutions  do  exhibit  the 
beating  phenomena  between  these  2  waves.  As  visible  clearly  from  figure  ll(c)-(d)  onwards 
both  the  buoyancy  and  vorticity  field  has  presence  of  elongated  cusp-like  structures  and 
one  can  see  both  waves  before  moving  past  each  other.  Even  at  a  much  later  time  (figure 
ll(e)-(f))  the  cusps  refuse  to  disappear.  If  one  takes  a  look  at  the  evolution  of  disturbance 
amplitude,  <  |V’|2  >1^2,  (figure  12)  the  persistent  beating  pattern  is  immediately  obvious. 
One  cannot  also  help  but  notice  in  figure  11(e)  the  inefficient  nature  of  mixing  of  the 
buoyancy  field  by  Holmboe  waves. 

A  qualitative  comparison  of  the  buoyancy  field  during  Holmboe  instability  computed  using 
‘defect’  numerics  with  those  obtained  from  experiments  and  3D  DNS3  [9]  can  be  seen  in 
figure  13.  Though  the  experiments  and  numerics  are  not  in  the  exact  parameter  regime 
as  the  ‘defect’  simulation,  it  is  reaffirming  to  observe  the  present  calculations  ability  to 

3  Both  experiments  and  numerics  were  done  for  Reynolds  number,  Re=630,  Richardson  number  being  0.3 
and  Prandtl  number  (temperature  equivalent  to  Sc)  kept  at  700  for  experiment  and  25  for  numerics. 
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Figure  9:  Vorticity  and  buoyancy  fields  at  different  instants  of  time  for  stratified  KH  insta¬ 
bility,  /  =  2,  d  =  0.1,  k  =  1,  A  =  1/3, 770  =  1. 
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Figure  10:  Evolution  of  disturbance  amplitude  for  stratified  KH.  Cj  =  0.9035 
reproduce  the  principal  signature  of  the  shear  layer  with  its  cuspy  arms  as  visible  in  others. 


5.3.3  Holmboe-KH  transition  ( d  =  1.5) 

Will  the  persistent  counter-propagating  cusps  be  present  irrespective  of  how  weak  the  strat¬ 
ification  is  as  long  as  linear  stability  predicts  them?  Simulations  for  d  =  1.5  as  seen  in  figure 
14  answers  in  negative,  d  =  1.5  is  away  from  the  critical  stratification  ( dcrn  =  0.5)  required 
for  transition  from  KH  to  Holmboe  as  predicted  by  linear  stability.  Cusps  get  formed  as 
expected  but  as  visible  in  figure  14(a)-(b)  the  cusps  while  moving  past  each  other  interact 
nonlinearly  and  lock  each  other.  Later  (figure  14(c)-(d))  these  locked  waves  overturn  lead¬ 
ing  to  stirring  of  the  buoyancy  field  leading  towards  a  KH  billow-like  stage  ((e)-(f)).  One 
can  find  reference  to  such  nonlinear  locking  and  subsequent  overturning/mixing  in  the  ex¬ 
periments  of  Hogg  and  Ivey [19].  It  is  encouraging  to  see  that  the  reductive  model  obtained 
can  capture  such  intricate  nonlinear  dynamics  at  very  low  computational  cost. 


5.4  Taylor-Caulfield  (TC) 

Instability  due  to  interaction  of  gravity  waves  via  shear  was  one  of  the  intriguing  aspects  of 
Taylor’s  (1931)  analysis.  It  belied  conventional  belief  that  one  needed  an  vorticity  maxima  in 
the  flow  or  unstable  stratification  to  induce  instability  in  stratified  shear  flow.  Moreover  the 
observations  of  staircase  formation  in  shear  flows  provides  a  strong  motivation  to  understand 
how  do  multilayer  density  profiles  interacts  with  shear.  Though  Taylor  introduced  the 
problem  as  a  canonical  stratified  shear  intability  it  wasn’t  until  through  a  series  of  papers 
by  Caulfield  and  co-workers  (linear  stability [10],  experiments[ll]  and  DNS[23])  that  a  great 
deal  was  known  about  the  interaction  of  multilayer  density  profiles  with  unbounded  shear 
and  mixing  layer.  The  present  section  will  be  focused  on  unbounded  shear  while  mixing 
layer  will  be  reserved  for  the  next  section. 

The  profile  chosen  for  instability  corresponds  to  the  one  already  dealt  before  in  the  linear 
stability  study  - 
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(e)  B,t  =  75.5  (f)2,t  =  75.5 


Figure  11:  Vorticity  and  buoyancy  fields  at  different  instants  of  time  for  pure  Holmboe 
instability,  /  =  2,d  =  8,fc  =  l,A  =  l/3,r/o  =  l. 
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Figure  12:  Evolution  of  disturbance  amplitude  for  pure  Holmboe  instability,  q  =  0.4005 


(a)  ‘Defect’  theory 


(b)  Experimental  results 


(c)  3D  DNS 


Figure  13:  Qualitative  comparison  of  buoyancy  field  as  obatined  from  ‘defect’  theory  and 
that  of  both  experiments  and  DNS  (Carpenter  et  al.  [9] ) 


375 


{e)B,t  =  62.5  (f)2,t  =  62.5 


Figure  14:  Vorticity  and  buoyancy  fields  at  different  instants  of  time  for  Holmboe-KH 
transition,  /  =  2,  d  =  1.5,  k  =  1,  A  =  1/3,  r/o  =  1. 
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Figure  15:  Evolution  of  disturbance  amplitude  for  Holmboe-KH  transition. Cj  =  0.5537 


for  the  parameters  d  =  5 ,k  =  1,  A  =  1/3,  r/o  =  2.  The  base-state  has  been  seeded  with 
purely  buoyancy  perturbation  and  vorticity  gets  produced  purely  due  to  baroclinic  effects. 
During  the  linear  instability  stage  the  baroclinically  generated  vorticity  from  the  initial 
conditions  gets  accumulated  at  the  density  interfaces  and  amplifies  (figure  16(a)-(b)).  Once 
the  stationary  TC  waves  get  nonlinearly  saturated  the  buoyancy  forms  a  billow  which  too 
exhibits  nutation  (figure  16(c),  (d)  and  (f)).  The  billow  formation  in  TC  instability  does 
not  involve  the  extensive  overturning  as  observed  in  stratified  KH  but  is  a  more  efficient 
mixing  process  than  pure  Holmboe  waves  (figure  16(e)). 


5.5  TC-Holmboe  transition 


Finally  let  us  consider  a  3-layer  density  profile  embedded  inside  a  mixing  layer.  The  extent 
of  the  density  layer  is  taken  to  be  smaller  than  that  of  the  mixing  layer  (figure  17). 
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Simulation  has  been  carried  out  for  /  =  2,d  =  4,  k  =  1,A  =  1/3, 771  =  1,772  =  2.  Linear 
stability  predicts  3  unstable  modes  -  a  faster  growing  stationary  TC  mode  and  a  pair  of 
slowly  growing  propagating  Holmboe  wave.  Figure  18(a)-(b)  shows  the  formation  of  a  billow 
in  the  initial  stages  which  starts  spewing  out  cusps  along  its  cusps  ((c)-(d)).  This  cuspy 
billow  persists  and  once  again  a  typical  Holmboe  feature  -  beating  amplitude  pattern  (figure 
19)  -  can  be  seen.  Thus  if  one  were  to  do  an  experiment  on  such  a  profile,  a  propagating 
instability  would  be  observed  despite  linear  stability  making  predictions  for  a  stationary 
billow.  Exploration  of  such  interesting  scenarios  with  great  ease  was  made  possible  by 
virtue  of  working  with  the  ‘defect’  equations. 


6  Nonlinear  stationary  states 

For  stratified  KH  and  TC  instabilities  the  time-trace  for  <  |?/|2  >1/2  seems  attain  a  sta¬ 
tionary  state  but  for  a  viscous  decay  (figure  10  and  16(f)).  A  qualitative  argument  for  a 
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(e)  Streamwise  averaged  instantaneous  buoy-  (f)  Evolution  of  disturbance  amplitude  for  Taylor- 
ancy  profile  (•  =  f  •  dx)  at  different  instants  Caulfield  instability,  a  =  0.5804 
of  time 


Figure  16:  Vorticity  and  buoyancy  fields  at  different  instants  of  time  for  Taylor-Caulfield 
instability,  d  =  5,  k  =  1,  A  =  1/3,  r/o  =  2. 
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Figure  17:  Density  layer  embedded  in  a  mixing  layer  (TC-Holmboe  transition) 


stationary  state  also  receives  support  when  one  observes  the  buoyancy  and  vorticity  plots 
at  the  final  time  computationally  attained  (figure  9(f)  and  16(c)).  A  time  evolution  of  the 
plots  show  that  they  are  remarkably  static.  Can  one  provide  quantitative  confirmations  to 
such  guesses? 

Let  us  consider  the  vorticity-buoyancy  defect  equation  under  inviscid,  steady4  assumptions, 


—  $x 
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i]Bx  +  $xB,n  =  0 
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The  exact  solutions  of  the  above  equations  are, 

B  =  T{£ ), 

(72) 

Z  =  Q(£)  +  r(P' {£)  =  Q(£)  -  Bv, 

(73) 

(tj  —  c)^ 

where,  £  — - b  $ 

’  2 

c  being  the  wave-speed  of  the  state.  V  and  Q  are  arbitrary  functionals  of  the  total  stream- 
function,  £.  Thus  any  nonlinear  stationary  state  needs  to  obey  the  above  functional  re¬ 
lationship.  We  go  ahead  and  try  applying  this  diagnostic  on  both  stratified  KH  and  TC 
instabilities  (c  =  0  in  both  cases). 

At  every  instant  of  time  besides  Z  and  B  the  total  streamfunction,  £,  is  also  computed. 
Thus  at  every  grid  point  (xj,  rjk)  values  of  B,  Z  &  £  are  known.  We  construct  pair  of  points 
-  (£ik,Bik)  and  (£&,  (Z  +  Bv)ik)-  Next  all  the  (£,£>)[(£,  ( Z  +  Bv))\  pairs  are  plotted  in  the 
B  —  £[(Z  +  Bv)  —  £]  planes.  If  there  indeed  exists  a  functional  relationship  then  the  scatter 
of  points  should  collapse  on  to  a  curve.  That  would  be  a  confirmation  that  the  obtained 
solution  is  indeed  approaching  a  nonlinear  stationary  state. 

First  it  is  applied  on  stratified  KH  (section  5.3.1).  As  visible  from  figure  20  the  initial 
functional  dependence  gets  broken  and  there  is  a  prominent  scatter  during  an  intermediate 
time  -  an  instant  of  strong  instabilities  (figure  20(c)-(d))  -  before  approaching  towards  a 
curve  (figure  20(e)-(f)).  The  near  horizontal  arm(s)  in  both  the  buoyancy  and  vorticity 
plots  corresponds  to  the  uniform  region  outside  the  shear  layer.  From  the  expression  of  the 

4If  the  state  is  moving  with  a  constant  wave-speed  c  then  a  co-moving  reference  frame  is  considered. 
Though  not  presented  here  such  scenario  was  observed  with  simulations  for  asymmetric  stratified  shear 
layer.  One  of  the  Holmboe  waves  is  then  preferred  and  the  system  translates  with  a  constant  speed. 
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Figure  18:  Vorticity  and  buoyancy  fields  at  different  instants  of  time  for  TC-Holmboe 
transition,  /  =  2,  d  =  4,  k  =  1,  A  =  1/3,  rji  =  1,  r]2  =  2. 
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Figure  19:  Evolution  of  disturbance  amplitude  for  TC-Holmboe  transition,  c*  =  0.807 


total  stream-function,  £,  one  can  see  why  it  should  be  large  and  negative  outside  the  shear- 
layer.  The  vertical  region,  which  hosts  most  of  the  dynamics  corresponds  to  the  defect. 
Also  the  locations  from  where  points  seem  to  spew  out  vertically,  represent  the  hyperbolic 
points  in  the  flow.  In  the  last  available  buoyancy  plot,  figure  20(e),  the  bulbous  head  is 
representative  of  the  cat’s  eye  region  of  the  flow.  As  visible  from  figure  9(e)  the  B  field  has 
not  homogenised  completely  and  there  is  signature  of  a  dipole-like  structure  in  the  cat’s 
eye,  which  contributes  to  the  bulge. 

As  seen  from  figure  21  similar  observations  are  also  true  for  a  Taylor- Caulfield  case  (section 
5.4). 

Having  identified  the  nonlinear  structures  the  nonlinear  stability  of  the  stationary  states 
can  be  investigated.  If  we  look  for  stationary  points  of  the  energy  under  the  constraints 
that  it  has  to  obey  -  the  conserved  quantities  determined  previously  (section  3)  -  we  have 
the  following  ‘Hamiltonian’  (we  defer  from  calling  the  system  Hamiltonian  since  we  have 
not  checked  for  the  Poisson  bracket  structure  for  it), 

2  t  2 

IK  =<  -  VB  -  cr]Z  +  F(B)  +  ^Z  +  ZG(B)  >  (74) 

where  <  .  >  denotes  average  over  entire  space  as  defined  before.  The  stationary  points  of 
the  system  are, 


^=0 
5Z  ’ 

5ft 
~5B  ~ 


£  =  G(B),  ^B  =  V{£) 
rj-F'(B) 


=>  z  = 


G'(B) 


=  Q(£)  +  r,V'{£) 


(75) 

(76) 


confirming  what  was  mentioned  in  the  beginning  of  this  section.  Here  5ft/ 5Z  is  the  func¬ 
tional  derivative  of  the  functional  ft.  To  check  stability  properties  we  need  to  check  the 
second  variation  ft. 


52ft  =  <  -5ip5Z  +  {F"(B)  +  Zg"(B)}(5B)2  +  2 Q'(B)5Z5B  > 

=  <{5Z  +  g\B)5B}2  +  {-5^5Z  -  (5Z)2}  +  {JF"(B)  +  ZQ"{B)  -  ( g'(B))2}(5B )2  > 


381 


In  this  form  a  comment  cannot  be  made  about  nonlinear  stability  as  S2cK  is  indefinite.  But 
if  f  dxdrj^Sil’dZ  —  ( SZ )2}  >  0  is  positive  definite  then  the  condition  for  stability  will  be  - 

{T"(B)  +  Zg"(B)  -  ( Q'(B ))2}  >  0 


Now  the  question  is  under  what  cases  is  the  assumption,  f  dxdr]{—8^>5Z  —  ( 5Z )2}  >  0,  true. 
In  ‘defect’  theory,  for  an  unbounded  domain,  ^  and  Z  can  be  connected  in  Fourier  space 
as, 

’^  =  ~2 \j  Z{k,ri,t)dri 


V>( k 


Thus  using  Parseval’s  theorem  we  have 


J  dx  d7i{—8ip5Z  —  (SZ)2}  >  0 

[  dkdrj{-8ip6Z*  -  \SZ\2}  >  0 


f  1 

f 

[2k 

/  d  r/SZ 

-  /  d7j\SZ\2  }  >  0 


(78) 

(79) 

(80) 


Presently  a  detailed  study  in  the  above  aspects  is  lacking  but  it  is  hypothesized  that  there 
would  exist  a  band  of  wavenumbers  in  which  a  stratified  shear  flow,  modelled  by  ‘defect’ 
theory,  will  be  stable.  For  higher  wavenumbers  one  can  always  find  instability. 

7  Conclusion 

Exploiting  the  sharp  variations  present  in  stratified  shear  layers  which  occurs  over  small 
regions,  ‘defect’  vorticity-buoyancy  equations  have  been  derived.  This  equation  maintains 
all  the  necessary  physics  -  unsteadiness,  nonlinearity  and  viscosity.  Linear  stability  cal¬ 
culations  using  the  ‘defect’  equations  becomes  significantly  convenient  with  the  present 
equations.  But  the  big  strength  of  ‘defect’  equations  was  found  to  be  in  doing  nonlinear 
calculations  for  various  stratified  shear  layer  profiles.  Unlike  conventional  numerics  done 
for  Bousinessq  Navier-Stokes  equations  the  ones  for  ‘defect’  equations  can  be  done  with 
minimum  computational  effort. 

We  have  been  able  to  explore  some  interesting  cases  concerning  nonlinear  evolution  of  strat¬ 
ified  shear  layers.  Though  a  great  deal  is  known  about  the  stratified  KH  instability,  we  were 
able  to  address  some  intriguing  questions  regarding  the  transition  from  KH  instability  to 
Holmboe  waves  and  also  whether  a  Holmboe  instability  persists  nonlinearly  with  decreasing 
stratification.  Very  little  is  known  about  the  nonlinear  dynamics  of  Taylor-Caulfield  insta¬ 
bilities.  An  effort  has  been  made  to  probe  TC  instability  in  greater  detail  while  working 
with  ‘defect’  equations.  Coexistence  of  a  TC  billow  with  a  Holmboe  cusp  was  also  observed 
while  studying  density  layers  embedded  in  mixing  layer. 

A  diagnostics  was  used  with  success  to  identify  nonlinear  stationary  structures  in  stratified 
KH  and  TC  instabilities.  Motivated  by  visual  arguments  a  check  was  done  on  the  buoyancy, 
vorticity  and  total  streamfunction  field  to  determine  if  the  flow  is  attaining  an  invariant 
form.  Efforts  to  perform  nonlinear  stability  have  so  far  been  inconclusive. 
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1.4 


Figure  22:  Time  evolution  of  the  scalar  variance 


Being  built  for  flow  with  sharp  interfaces  or  local  distortions,  the  ‘defect’  theory  has  immense 
potential  for  stratified  shear  layer  studies.  TC  instabilities  is  far  from  being  well-understood 
and  it  will  be  very  interesting  to  explore  a  wide  range  of  parameter  space  armed  with  ‘defect’ 
theory.  An  aspect  completely  overlooked  in  the  present  analysis  is  the  estimation  of  mixing 
in  the  considered  flows.  Since  Sc  — >  oo  was  the  domain  of  interest  an  estimate  of  stirring 
was  made  based  on  the  scalar  variance,  <  |VR|2  >.  A  low  value  of  <  |V£>|2  >  denote 
homegenisation  of  the  scalar  field.  Figure  22  does  indeed  highlight  an  already  discussed 
aspect,  based  on  visual  grounds,  of  mixing/stirring  efficiency  being  maximum  for  stratified 
KH,  followed  by  TC  and  then  Holmboe.  A  more  careful  analysis  needs  to  be  done  in  this 
regard  and  comparisons  made  with  existing  DNS  and  experimental  results. 

The  ‘defect’  equations  are  amenable  to  analytical  approaches  more  easily  than  the  govern¬ 
ing  equations  from  which  they  have  been  derived.  We  haven’t  really  used  this  to  obtain 
the  solutions  in  the  nonlinear  states  in  presence  of  dissipation.  A  quasi-steady  critical  layer 
theory  approach  is  being  investigated  under  the  framework  of  ‘defect’  equations  to  answer 
the  asymptotic  form  of  the  nonlinear  stationary  states.  Finally  a  potential  future  work 
which  has  been  left  out  from  the  present  study  is  the  study  on  the  viscous  spectrum,  issue 
of  quasimodes  and  transient  growth. 
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